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This invention relates to nucleotide sequences which code for enzymes possessing cyclosporin synthe- 
tase^like activity and to nrtethods for the production of cyclosporin and cyclosporin derivatives using these se- 
quences. 

The fungus Tolypocladium niveum (previously known as Tolypociadium inflatum GAMS) produces cydo- 

5 sporins, a group of neutral cyclic peptides composed of eleven amino acids. Other fungi have been found which 
may form cyclosporins (Dreyfuss, 1 986; Nakajima etaL, 1989) but Tolypocladium niveum is the most important 
organism for the production of cyclosporins by fermentation. Cyclosporins exhibit remarkable biological ef- 
fects: for example cyclosporin A, the main metabolite, is a potent Immunosuppressant (Bore! et ai, 1976). An 
enzyme has been identified which catalyses the entire peptide biosynthesis of cyclosporin and is therefore 

10 called cyclosporin synthetase (Zocher ef al„ 1986, Billich and Zocher 1987). The biosynthesis proceeds non- 
ribosomally by a thiotemplate process, as has also been described for other peptide synthetases (Kleinkauf 
and von Dohren 1990). Each amino acid is first activated in the form of an adenylate, then bound in the form 
of a thioester and linked with the following amino acid to the peptide. In the case of cyclosporin A, seven of 
the amino acids, bound as thioesters, are methylated before they are linked to the preceding amino acid in a 

15 peptide bond. This methylation function is an integral constituent of the enzyme polypeptide (Lawen and Zocher 
1990). Including the cyclisation reaction, cyclosporin synthetase performs at least 40 reactions. 

Cyclosporin A contains three non-proteinogenic amino acids: D-aianine in position 8. a-amino butyric acid 
in position 2 and, in position 1, the unusual amino acid (4R)-4-[(E)-2- butenyl]-4-methyl-L-threonine (Bmt or 
C9 amino acid). All three amino acids must be each prepared by a biosynthetic pathway which is independent 

20 of the primary biosynthetic pathway. Cyclosporin synthetase does not possess an alanine-racemase function 
(Kleinkauf and von Dohren 1990) and thus, D-alanine cannot be produced by cyclosporin synthetase by epi- 
merisation of enzyme-bound L-alanine, as is the case for other peptide antibiotics whose biosynthesis mech- 
anism is known. 

Although attempts have been made to isolate and characterize cyclosporin synthetase in terms of its amino 
25 acid sequence, because of the complexity and size of the enzyme this has not to date been possible. Hence 
it has not been possible to characterize the DNA coding for cyclosporin synthetase. 

This invention provides a nucleotide sequence which codes for an enzyme possessing cyclosporin syn- 
thetase-like activity, in the present specification, an enzyme possessing cyclosporin synthetase-like activity 
is an enzyme which catalyses the peptide biosynthesis of cyclosporins and structurally related peptides and 
30 derivatives. 

Preferably, the nucleotide sequence codes for cyclosporin synthetase or an enzyme which is at least 70% 
(for example, at least 80, 90 or 95%) homologous to it and which possesses cyclosporin synthetase-like ac- 
tivity . 

Preferably, the nucleotide sequence codes for an enzyme which possesses cyclosporin synthetase-like 
35 activity and in which at least one amino ackJ recognition unit is different from that of cyclosporin synthetase. 

Preferably, the nucleotide sequence comprises the sequence represented in Seq Id 1 or a sequence which 
hybridises to it under conditions of reduced stringency or, more preferably stringent coinditions. Stringent con- 
ditions include hybridisation at42''C in 6xSSPE, 50% formamide, 5x Denhardfs solution, and 0.1% SDS and 
washing three times for 10 minutes in 2 x SSC, 0.1% SDS and twice for 30 minutes in 0.2 x SSC, 0.1% SDS 
40 at 65''C. Reduced stringency conditions include a washing temperature of 60''C. Even more preferably the nu- 
cleotide sequence codes for an enzyme having the amino acid sequence set out in Seq id 2. The nucleotide 
sequence may have a restriction map as represented in figure 1 . 

In another aspect, the invention provides a recombinant vector containing a nucleotide sequence as de- 
fined above. The vector may include the endogenous promoter for cyclosporin synthetase or may include some 
45 other suitable promoter. A suitable promoter regbn is illustrated in Seq id 7. The recombinant vector may be 
in the form of a plasmid, a cosmid, a Pi-vector or a YAC-vector. The invention also extends to host cells car- 
rying the vector. Preferably the host cell is a Tolypocladium niveum cell. 

The invention also provides a process for the production of cyclosporin or a cyclosporin derivative, com- 
prising cultivating a host cell as defined above and causing the host cell to produce the cyclosporin or cydo- 
50 sporin derivative. 

The invention also provides a method for the production of a cyclosporin derivative, comprising altering 
the DNA sequence coding for cyclosporin synthetase so that the enzyme causes the production of the cyclo- 
sporin derivative, placing the altered DNA sequence in a vector, transforming a host cell with the vector, and 
causing the host cell to produce the cyclosporin derivative. Preferably the DNA sequence coding for cyclosporin 
55 synthetase is altered by changing the fragments that code for amino acid recognition units. Alterations may 
be made using standard techniques such as those based on PCR procedures. Point deletions, mutations and 
insertions, as well as larger alterations are possible. 

This specification describes the isolation and characterisation of the gene for cyclosporin synthetase from 
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Toiypociadium niveum and the use of the gene in genetically engineering cells, including Totypocladium nh/eum 
cells. While a protocol for the isolation of cyclosporin synthetase from Totypodadium niveum was published 
in 1990 (Lawen and Zocher 1990), it is however not suitable for extracting large quantities of homogeneous 
enzyme in a short period of time. Also, in the publication, the synthetase was attributed an of approximately 

5 650,000 Dalton. It may, however, justifiably be assumed from sedimentation analyses with fluorescence-lab- 
elled protein (Lawen et a/., 1992) and by extrapolation from the protein size of comparable enzymes that cy- 
closporin synthetase has an Mr of approximately 1 ,500 kDa. The enzyme occurs as a single polypeptide chain 
and cannot be decomposed Into subunits by either denaturing or reducing agents (Lawen and Zocher 1990). 
The enormous size of the enzyme means that a strategy for amino acid sequencing which differs from 

10 the customarily used route must be used. Substantially more homogeneous material is required than is gen- 
erally used to perform fragmentation tests. It is for this reason that a protocol was developed for cyclosporin 
synthetase which may, in principle, also be applied to analogous enzymes from other microorganisms and, in 
the practical example of the purification of the enzyme from Totypocladium niveum (example 1), gave rise to 
a substantial improvement in terms of yield and the amount of time required. 

15 Purification may initially proceed according to customary processes. Cell disruption may be performed, 

for example, with a high pressure homogenlser or a glass bead mill; the cells being present in moist or lyophll- 
ised state. If the cells are moist, pressure disruption is conveniently performed, for example with a Maunton 
Gaulin apparatus. Lyophlllsed cells are conveniently broken up by grinding in a mortar under liquid nitrogen. 
The crude extract so obtained is clarified by centrifugation. The nucleic acids are removed by precipitating 

20 them from the extract using customary reagents for this purpose; polyethyleneimlne or protamine sulphate are, 
for example, used. The nudeicacid precipitation also removes fine suspended particles, which can disturb sub- 
sequent purification stages. Then the proteins may be precipitated out of the clarified crude extract to provide 
the enzyme in a more concentrated form. The protein precipitation is customarily performed with ammonium 
sulphate. For cyclosporin synthetase, saturation to 50% is sufficient to achieve almost complete precipitation. 

25 After this step, the enzyme is in an enriched and highly concentrated state. 

In principle, all chromatographic methods are suitable for further purification of the enzyme, such as ion- 
exchange chromatography and gel permeation chromatography. With very large proteins, gel pernneation chro- 
matography is particularly suitable as a very selective purification step. If the correct molecular sieve is chosen, 
an approximately 90% homogeneous protein preparation may be obtained in a single step. Analysis of purity 

30 is performed in SDS polyacrytamide gels (preferably gradient gels 4-15%). 

The purification process used produces stable, at least 90% homogeneous, active enzyme preparations, 
as is necessary for characterisation of enzyme kinetics or protein chemistry. In Example 1, the protocol de- 
scribed in detail for Totypocladium niveum, in comparison with the published method, reduces the time required 
from 4 days to 10 hours and increases the yield by approximately a factor of 4. 

35 With a protein of this exceptional size, the requirement for amino acid sequences to identify the gene or 

gene product correctiy is naturally greater than for an average-sized protein. Apart from the possibility of N- 
terminal blocking, it is also not possible to prepare a protein of this size in such a way that it is suitable for N- 
terminal sequencing. For these reasons, It is necessary to obtain a sufficient number of internal amino acid 
sequences. 

40 However, when a protein of this size is fragmented, so many fragments are produced (theoretically ap- 

proximately 700, assuming one cleavage every 20 amino acids) that the standard method of completely frag- 
menting the protein and purifying the fragments by high-pressure reversed- phase chromatography (HP-RPC) 
is not practicable. For this reason, fragmentation is performed under conditions which are sub-optimal for the 
relevant endoproteinases to give substantially larger fragments. 

45 Cyclosporin synthetase is cleaved by adjusting the pH value. In particular, cleavage into large fragments 

of up to 200 kDa is achieved by adjusting the pH value to approximately 7.5 in a HEPES buffer with the addition 
of EDTA and DTT. The fragments obtained in this manner may be isolated and enriched as is conventional, for 
example by using chromatography and electrophoresis, such as the combination of anion exchange chroma- 
tography on MonoQ with HP-RPC or the combination of MonoQ with SDS-polyacrylamide gel electrophore- 

50 sis/electroblot. 

The sub-optlmat conditions are principally obtained by altering the buffer conditions, and possibly also al- 
tering the cleavage temperature (see Example 3 as a possible variant). The nonetheless numerous fragments 
must each be isolated or enriched by 2 purification steps, it being in principle possible to use any chromato- 
graphic and electrophoretic separation techniques. In the case of cyclosporin synthetase fragments from Tol- 
55 ypocladium niveum, the combinations of anion exchange chromatography on MonoQ with HP-RPC (Examples 
4 and 5) and MonoQ with SDS-pdyacrylamide gel electrophoresis/electroblot (Examples 4 and 6) prove par- 
ticularly advantageous. 

The non-ribosomal biosynthetic pathway implies that the sequence of the cyclic peptide is determined by 
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the corresponding arrangement of the amino acid activating domains. Each of these domains must perform 
analogous reactions, namely the activation of the amino acid by adenylation and binding in the form of a thioe- 
ster. Hence it may be expected that recurrent, preserved moieties will be found in the protein sequence. 
In fact in previously analysed peptide synthetases, preserved regions within the sequences have been 
5 discovered, the number of which coincides with the number of amino acids to be activated: three for ACV syn- 
thetase (activates aminoadipic acid, cysteine and valine; Smith etaL, 1990, MacCabe etal,, 1991, Gutierrez 
et a/., 1991); one each for gramiddine synthetase I (Kraetzschmar et al., 1989) and tyrocidine synthetase I 
(Weckermann etaL, 1988); and four preserved regions in gramicidine synthetase 2, which activates the amino 
acids proline, valine, ornithine and leucine (Turgay etaL, 1992). 
10 Maximally accurate identification and characterisation of such preserved regions of cyclosporin synthetase 

at both the enzymatic and genetic levels constitutes the basis for well-directed genetic engineering in terms 
of altering enzyme specificity for the in vivo production of cyclosporin variants. It is therefore useful to identify 
proteolytic fragments of cyclosporin synthetase which may be correlated with a partial function of the synthe- 
tase. The following correlations were made: 
15 (1) a protein fragment with a methyl transferase function (the method on which this work is based is, in 

principle, applicable to all methyl transferases and is published in Yu ai., 1983; a first application to cy- 
closporin synthetase is published in Lawen and Zocher 1990); see Example 7; 
(2) a protein fragment capable of activating L-alanine (Example 8). 

The method used in Example 8 exploits the fact that when proteins are subjected to limited proteolytic 
20 cleavage, inter alia intact domains are cleaved which, due to their correct spatial folding, are still capable of 
exercising their enzyme function to a limited extent Theoretically, therefore, each amino acid activating domain 
may be identified with this method. The optimal conditions (for proteolytic cleavage and its timing in relation 
to amino acid activation) must, however, be determined by testing In each Individual case. Moreover, unam- 
biguous identification of a domain may be achieved only if the amino acid it activates occurs only once in the 
25 product 

The gene is isolated by DNA hybridisation with oligonucleotides specific to cyclosporin synthetase (Ex- 
ample 10). Whether a specific DNA fragment actually belongs to the cyclosporin synthetase gene is estab- 
lished by Northern hybridisation, since a non-transcribed neighbouring fragment does not hybridise with the 
corresponding RNA (Example 15). The DNA sequence of the cloned DNA of the cyclosporin synthetase gene 
30 is determined and compared with the amino acid partial fragments of cyclosporin synthetase (Examples 13 
and 14). 

Hence it is possible to transform Tolypocladium niveum with the complete gene for cyclosporin synthetase. 
Among the transformants, strains may be found which contain several copies of this gene or copies with altered 
regulation. Those strains are selected which, in fermentation tests, display increased cyclosporin formation or 

35 can form the same quantity of cyclosporin over a shorter fermentation period. 

It is also possible to select the transformed strains by the activity of the cyclosporin synthetase, indepen- 
dentiy of whether cyclosporin is formed in greater quantities or faster The isolated cyclosporin synthetase gene 
can act as an analytical aid in order to determine whether a specific strain of Tolypocladium niveum has a high 
concentration of the mRNAor not (Example 15). Such strains may then be subjected to conventional muta- 

40 genesis and strain selection. Even if the initial strain used for transformation is not limited in its cyclosporin 
synthetase activity, a strain is provided in this way which potentially allows greater cyclosporin formation. The 
combination of classical genetics (mutation and strain selection) with molecular genetics (transformation with 
isolated genes) allows the isolation of improved strains which could not be achieved by either of the two meth- 
ods alone: not by classical genetics because a double mutation is extremely rare in a single selection stage; 

45 not by molecular genetics because in some circumstances an unknown factor has a limiting effect 

Afurther use of the isolated gene Is gene-specific mutagenesis. Instead of producing mutations in the en- 
tire genome - and therefore also altering many unlnvolved genes - the isolated gene alone is mutated using 
suitable methods (Sambroock et a/., 1989) and then transformed to Tolypocladium niveum (Example 17). 
Among the transformants, the proportion of mutants in the cyclosporin synthetase gene is higher than with 

so mutagenesis of the fungus. Mutants, which form specific cyclosporins in greater or reduced quantities, may 
more frequentiy be found than with conventional mutagenesis. 

By internal sequence comparisons of the derived amino acid sequences (Example 14c) and the correlation 
of specific partial sequences (Example 8 and Example 9 or Example 14ab), domains of the cyclosporin syn- 
thetase for the activation of the individual amino acids may be localised (as performed above for non-ribosomal 

55 peptide synthetases). By this means, well-directed mutagenesis of cyclosporin synthetase gene may be per- 
formed, by interchanging the gene region of individual domains, by deliberately removing a corresponding re- 
gion or the cyclosporin synthetase gene may also be extended by individual domains. After transformation of 
such mutated genes into Tolypocladium niveum, new cyclosporin variants may become accessible. The cloned 
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gene may be used to produce strains of Tolypocladium niveum which no longer have an active cyclosporin syn- 
thetase gene. Such strains may l>e used for the production of D-alanine or Bmt by fermentation or act as re- 
cipient strains for in vitro modified cyclosporin synthetase genes. To this end, an inactive version produced in 
vitro is constructed for the transformation (Example 18). 

When screening for microorganisms which can synthesise cyclosporins, it is necessary that the active me- 
tabolites under test conditions are also actually formed in sufficient quantity. Such substances may moreover 
have slightiy changed characteristics and may for this reason alone be overlooked. Example 16 describes the 
use of the isolated cyclosporin synthetase gene to find microorganisms which contain the cyclosporin synthe- 
tase gene in their genome. These genes do not have to be active for this purpose. On the basis of these hy- 
bridisations, the corresponding genes may be isolated in a manner analogous to Examples 10, 11 and 12 and 
transformed into Tolypocladium niveum. A strain may be used to this end which no longer contains any active 
cyclosporin synthetase. This interspecific recombination cannot be achieved with other methods. As described 
in the preceding paragraph, such strains may be subjected to a screening programme. In this case, genetic 
variability is based on the introduced gene which hybridises with the cyclosporin synthetase gene. 

The control sequences of the cyclosporin synthetase gene may also be used for the construction of plas- 
mids. An example of a control sequence is that which occurs in synp4 (Example 12). The promoter may be 
fused with a readily detectable reporter gene, such as for example the p-glucuronidasegene (Tada etal., 1991). 
Strains of Tolypocladium niveum which are transformed with these plasmids permit, not only the selection of 
regulatory mutants, but moreover make it possible to measure and optimise promoter activity independentiy 
of other functions. 

The following examples and figures illustrate the invention without, however, limiting It. 

Figure 1: Restriction map of cyclosporin synthetase gene from Tolypocladium niveum cloned in XSYN3. 
The posttbn of some restriction cleavage points is shown in relation to a scale (2.0, 4.0, 6.0, etc. kb). Among 
these, several partial fragments subcloned In plasmids are represented as rectangles (S5, E3, S3, etc.). If the 
corresponding rectangle is filled in. this means that the corresponding DNA fragment reacts with a high mo- 
lecular weight RNA in Northern hybridisation (S5. E3, S3. El, E2). Rectangles with lengthwise lines indicate 
that no bands were obtained in Northern hybridisation (E4, S2). Empty rectangles indicate that the DNA was 
not used as a probe (S4). The following two tables give the positions of the fragments (S5, H2, etc) and enzyme 
restriction sites shown in figure 1 (in bp): 



Start 


End 


Fragment Name 


1 


2500 


S5 


1300 


3300 


H2 


2000 


5400 


E3 


2500 


5300 


S3 


4700 


11750 


H3 


5300 


8400 


S4 


5400 


7000 


El 


7000 


9200 


E2 


9200 


12100 


E4 


10250 


13850 


S2 
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5 



10 



Enzyme R strict! n sites : 


Sail 


1. 


HIndlll 


1300. 


EcoRI 


2000. 


Sail 


2500. 


Hindlll 


3300. 


Hindlll 


3800. 


HIndlll 


4700, 


Sail 


5300. 


EcoRI 


5400. 


EcoRI 


7000. 


Sail 


8400. 


EcoRI 


9200. 


Sail 


10250, 


HIndlll 


11750. 


EcoRI 


12100. 


Sail 


13850. 











Figure 2: Restriction map of plasmtd pSIMIO. The construction and structure of the plasmid e described 
in Example 18. The positions are stated in bp. Nucleotides 4749-6865 are DNAfrom Toiypodadium niveum 
containing the promoter of the cyclophllin gene. Nucleotides 1-1761 contain the hygromycin phosphotransfer- 
ase gene from plasmid pCSN44 (Staben et a!., 1989). Nucleotides 1761-4714 are from plasmid pGEM7Zf 
(Promega Inc.). 

Figure 3: Restriction map of plasmid pSIM 11. Construction of the plasmid is described in Example 18. Nu- 
cleotides 4924 to 8553 are the 3.6 kb Xhol restriction fragment from the cyclosporin synthetase gene. Nucleo- 
tides 8548-10489 and 1-4929 are plasmid pSIMIO (figure 2). 

Figure 4: Restriction map of plasmid pSIMI 2. Construction of the plasmid is described in Example 1 8. Nu- 
cleotides 4924 to 5727 are the 0.8 kb Xhol restriction fragment from the cyclosporin synthetase gene. Nucleo- 
tides 5722-7663 and 1-4929 are plasmid pSIMIO (figure 2). 

Figure 5: Restriction map of cyclosporin synthetase gene from To/yf>ocladium niveum cloned in syncosl3. 
The position of some restriction cleavage points is shown. The position of the part cloned in Xsyn3 is marked 
with the crosshatched bar. 

All the restriction maps shown in figures 1, 2, 3, 4 and 5are only approximate reproductions of restriction 
cleavage points in DNA molecules. The distances as drawn are proportional to the actual distances, but the 
actual distances may be different Not all restriction cleavage point are shown, it is possible for further cleavage 
points to be present. 



Example 1: Isolation of active cyclosporin synthetase In electrophoretically homogeneous form: 

The starting material used for the protein purification is Toiypodadium niveum, strain 7939/45 (Lawen et 
aL, 1989). All steps are performed at a temperature between 0*" and 4°C. 10 g of lyophilised mycelium is finely 
ground in a mortar with addition of liquid nitrogen and then suspended in buffer A (buffer A: 0.2 M HEPES pH 
7.8, 0.3 M KCI, 4 mM EDTA. 40 (v/v)% glycerol, 10 mM DTT). The suspension Is carefully stirred over ice for 
1 hour and then centrifuged for 10 min at 10,000 g to remove cell debris. 

The supernatant is collected and nucleic acids are precipitated with polyethyleneimine (final concentration 
0.1%). The precipitate is removed by centrifugation for 10 min at 10,000 g. 

The supernatant is again collected and proteins are precipitated using a solution of ammonium sulphate 
(saturated) in buffer B (0.1 M HEPES pH 7.8, 4 mM EDTA. 1 5 (v/v)% glycerol, 4 mM DTT) at room temperature. 
The solution is added dropwise to the supernatant up to a final concentration of 50% of saturation. The mixture 
is left to stand for a further 30 minutes to reach equilibrium. The precipitated proteins are collected by centri- 
fugation for 30 minutes at 30,000 g. The pellet obtained is resolubilised to 10 ml in buffer B. 

The resolubilised pellet is then subjected to molecular sieve chromatography. The molecular sieve is a 
HW65-F Fractogel obtained from Merck; the column dimensions are 2.6 cm x 93 cm, and the volume is 494 
ml. The column is operated under fast performance liquid chromatography (FPLC) conditions. The flow rate 
is 2 ml/min, continuous under buffer B. The cyclosporin synthetase elutes under these conditions at an elution 
volume of 260 to 310 ml. Processing 10 g of lyophilised mycelium produces 50 mg of cyclosporin synthetase 
in electrophoretically homogeneous form within 10 hours. 

Example 2: D t ction of enzymatic activity of cyclosporin synthetase : 

80 ^l of an enzyme sample in buffer B are incubat d, in a total volume of 130 ^l, with 3.5 mM ATP, 8 mM 
MgCl2, 10 mM DTT, 10 |iM C9 acid, 690 ^M of any other constituent amino acid and 100 |iM S-adenosyl-me- 
thionin + 2 ^iCi of adenosyi-L-methionine-S-[methyl-3H] (75 Ci/mm 1) for 1 hour at 22**C. Extraction and de- 
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tection of the cydospoiin A formed are performed as described in Billich and Zocher 1987. 
Example 3: Endoproteinase cleavages: 

5 The following endoproteinases (Boehringer Mannheim, sequencing grade) are used: trypsin from bovine 

pancreas (cleaves after arginine and lysine); LysC from Lysobacter enzyrnogenes (cleaves after lysine); GluC 
= V8 from Staphyiococcus aureus (cleaves after glutamic acid and aspartic acid). 

The cleavages are not performed under the conditions recommended by the manufacturer; but rather un- 
der 'sub-optimal' conditions. The cyclosporin synthetase is incubated in its storage buffer (0.1 M HEPES pH 

10 7.5. 4 mM EDTA, 4 mM DTT, 15 (wA^)% glycerol) with protease in a ratio of 100 pg : 1 ^g for 2 to 3 hours at 
25*'C. in this way, fragments of a size up to approximately 200 kDa are produced. 

Example 4: MonoQ purification of fragments: 

15 Purification is performed using a commercially available MonoQ column (HR 5/5) obtained from PHAR- 

MACIA, at 4°C. The protease digested protein sample is diluted (1:5) in buffer 1 (20 mM HEPES pH 7.5. 2 mM 
EDTA, 2 mM DTT, 5 w/v% glycerol) and applied to the column. The gradient elutlon of fragments is carried out 
in 20 ml of 0% to 100% buffer 2 (buffer 1 + 500 mM NaQ). 

20 Example 5: HP-RPC purification of MonoQ fractions: 

Purification is performed using a commercially available Nudeosil 300A-C4-5p column of dimensions 85 
x 4.5 mm. The MonoQ fraction sample is diluted (1 :5) in buffer 1 (5% acetonitrile, 0.1% TFA) and applied at a 
flow rate of 1 ml/min and room temperature. Gradient elutlon is carried out in 85 minutes from 0% to 100% 
25 buffer 2 (90% acetonitrile. 0.1% TFA). 

Example 6: SDS-PAGE/Blot purification of MonoQ fractions: 

SDS-PAGE is performed according to Lammli (1970). Thioglycolic acid (2 mM) is added to the electro- 
30 phoresis buffer in order to prevent the N termini being blocked by residual radicals from the polymerisation 
reaction. The MonoQ fractions are used after denaturation with SDS for the electrophoresis. For sequencing, 
the proteins are blotted out of the gel onto glass fibre membranes ("Glassy bond" from Biometra) using the semi- 
dry method. 

35 Example 7: Protein fragment with methyl transferase activity: identification and purification 

The active centre of methyl transferases may be crosslinked with its substrate S-adenosyl-methionine by 
UV irradiation. This may be exploited by providing a radioactive substrate and so achieving radioactive labelling 
of the enzyme (Yu et aL, 1983). This method, which is also known as "photoaffinity labelling", has been used 
40 on cyclosporin synthetase (Lawen and Zocher 1990) and it is possible to show that several labelled protein 
fragments are produced upon subsequent protease digestion. A labelled fragment is enriched by a combination 
of the methods described in Examples 4 and 6 and so made accessible to sequencing (see Example 9: aa4). 
This fragment has a size of approximately 47,000 Dalton. 

45 Example 8: Amino acid activating protein fragments: identification and purification 

Protein fragments that have the capacity to activate an amino acid are identified by loading the synthetase 
with radioactively labelled amino acid in the simultaneous presence of an endoproteinase. Approximately 500 
^g of purified cyclosporin synthetase are incubated with 25 mM of ATP, 30 mM MgCl2 and 5 pCi of ^^c-L-alanine 

50 and are simultaneously treated with, for example, endoproteinase LysC. The reaction is arrested after 3 hours 
by precipitation of the proteins with TCA. The fragments are resolubilised in a sample buffer for SDS-PAGE, 
omitting reducing agents. Half of the batch is subjected to SDS-PAGE and the labelled protein fragment is de- 
tected by autoradiography of the gel after amplification in "amplify solution" (from NEN) and drying. Afragment 
with a Mr of approximately 140.000 Dalton is identified and enriched by a combination of the methods described 

55 in Examples 4 and 6. The amino acid sequence is given in Example 9: aa13. 
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Example 9: Amino acid partial sequences of cyclosporin synthetase: 



The following partial sequences are obtained from cyclosporin synthetase obtained from Example 6. 





aal: 


amino 


acids 


5 


aa2: 


amino 


acids 




aa3: 


amino 


acids 




aa4: 


amino 


acids 




aa5: 


amino 


acids 




aaS: 


amino 


acids 


10 


aaS: 


amino 


acids 




aa9: 


amino 


acids 




aa10 


amino 


acids 




aa12 


amino 
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Example 10: Isolation of X-cfones which hybridise with an oligonucleotide specific to cyclosporin synthetase 

a) Construction of a genomic X-gene library from Tolypocladium niveum, 

25 

DNA is isolated from the mycelium of a culture of Tolypocladium niveum grown in medium 1 [50 g/1 of mal- 
tose, 10 g/l of casein peptone (digested with trypsin, Fluka). 5 g/l of KH2PO4 and 2.5 g/l of KO; the pH value 
is adjusted to 5.6 with phosphoric acid]. 4 ml of a spore suspension of Tolypocladium niveum strain ATCC 34921 
with 4 X 108 spores per ml are added to 200 ml of medium 1 in a 1 I conical flask and are shaken for 72 hours 
30 at25°C and 250 rpm. The mycelium is filtered off with a Buchner funnel, washed with 10 mM of tris-CI pH 8.0, 
1 mM EDTA and ground to a fine powder under liquid nitrogen. Nuclei are isolated from 40 g of moist mycelial 
mass and are then lysed; the DNA is purified by CsCI-EtBr centrifugation. This method is described in Jofiiku 
and Goldberg (1988). 4.3 mg of DNA are obtained, which, in a 0.5% agarose gel, produces a band exhibiting 
lower mobility than X-DNA. 

35 40 ^g of the DNA are incubated with 1.4 units of the restriction enzyme SauSA in 10 mM of tris-CI pH 7.5, 

10 mM MgCl2, 1 mM of DTE, 50 mM of NaCI for 60 minutes at 37«C and then 10 minutes at 65*'C. The extent 
of deavage is verified on an agarose gel: part of the DNA is between 10 and 20 kb in size. The DNA is then 
applied to two NaCI gradients, which are produced by freezing and slowly thawing at 4°C two Beckman SW28.1 
ultracentrifuge microtubes with 20% NaCI in TE (10 mM tris-CI. pH 8.0, 1 mM EDTA). The microtubes are cen- 

40 trifuged for 16 hours at 14,000 rpm in Beckman L8M ultracentrifuge in rotor SW28.1. The contents of the mi- 
crotubes are fractionated. Fractions with DNA larger than 10 kb are combined and dialysed against TE. After 
concentration of the DNA to 500 ng/ml. the DNA is combined with XEMBL3-DNA (Promega Inc.), previously 
cleaved with EcoR\ and fiamHI. 1 .5 ^ig of the DNA and 1 ng of >.EMBL3-DN A (cleaved with EcoRI and BamHI) 
are ligated for 16 hours at 16X in 5 \i\ of 30 mM tris-CI pH 7.5, 10 mM of MgCU. 10 mM of DTE, and 2.5 mM 

45 ATP after the addition of 0.5 U of T4-DNA ligase (DNA concentration 500 |ig/ml). The ligation mixture is pack- 
aged invilro with the assistance of protein extracts ("packaging mixes", Amersham). The X-lysates produced 
are titrated with E. coli KW251 (Promega Inc.). Approximately 4.5 x 10^ pfu are obtained. 

b) Isolation of X-dones 

50 

40,000 recombinant phages from the Tolypocladium niveum gene library are cast with E. co// strain KW251 
onto 90 mm TB plates (TB contains 10 g/l of bacto tryptone and 5 g/l of NaCI and 0.7% of agarose, the pH is 
adjusted to 7.5 with NaOH). Two blots onto nitrocellulose (Stratagene) are made from each plate (Maniatis et 
aL, 1982). From the amino acid sequence of the cydosporin synthetase fragment aa9 (Example 9), an oligo- 
55 nudeotide mixture (96 different oligonudeotides, each 20 nudeotides in length) with the sequences 



8 



BNSDOCID: <EP ^05786 16A2_I_> 



EP0 578 616 A2 



5' GCA TCA ATA TTA AAT TGA TC 3' 
G G G G C G 
T 

5 

may be produced on the basis of the genetic code. 1.5 ^g of this oligonucleotide mixture are incubated In 25^1 
of 50 mM tris-CI pH 9.5. 10 mM MgQz. 5 mM DTE, 5% glycerol with 150 ^iCi y-ATP (^P) and 20 U of polynu- 
cleotide kinase (Boehringer) for 30 minutes at Zl^'C. Over 80% of the radioactivity is incorporated. Hybridisation 

10 is performed at 37<*C in 400 ml 6 x SSPE (Maniatts et aL, 1982), 5 x Denhardf s solution (Maniatis et al., 1982), 
0.1%SDS, 100 ^g/ml denatured herring sperm DNA (Maniatis etaL, 1982), 0.1 mMATP, 1.4x 10»cpm/ml ^ZR- 
labelled oligonucleotide mixture for 16 hours. The filters are washed three times for 5 minutes and twice for 
30 minutes in 6 x SSC (Maniatis et a/., 1982) at 4°C. The filters are then washed for 10 minutes at 37°C in a 
TMAC (tetramethylammonium chloride) washing solution which is prepared according to Wood et aL, 1985. 

15 Finally, the fitters are washed for 30 minutes at 57*>C in the TMAC washing solution, dried and exposed for 10 
days with a Kodak Xomatik AR X-ray film. Regions of the agarose layer corresponding to positive signals on 
the X-ray film are punched out and resuspended in SM buffer (5.8 g/l NaCI, 2 g/1 MgS04 x 7 H2O and 50 mM 
tris-CI pH 7.5). A suitable dilution is again cast with KW251 onto a TB plate. The plaques are again transferred 
onto nitrocellulose. The DNA is isolated from plaques producing a positive hybridisation signal in the second 

20 hybridisation. The purified DNA from these phages is used for Southern hybridisations and restriction analyses. 
Figure 1 shows the restriction map of the Tolypocfadium niveum proportion of such a X-done (= XSYN3). Sub- 
doning is performed in various plasmid vectors (for example pUC18, Pharmacia). 

To Isolate X-dones containing the neighbouring DNA fragments ("chromosome walking"), the plaque hy- 
bridisation method described above is repeated a number of times; the marginal restriction fragments being 

25 used in each case as 32p-iabelled probes. In order to done the DNA adjoining the region shown schematically 
in figure 1 (XSYN3), fragment S5 is used (figure 1). Hybridisation is then performed at42^C in 6 x SSPE. 50% 
formamide, 5 x Denhardfs solution, 0.1% SDS, 100 |ig/ml denatured herring sperm DNA, and 100 ^M ATP. 
Before hybridisation, the 32p.|abelled DNA is heated to lOO^'C for 5 minutes and cooled in ice. After 16 to 20 
hours, the filters are washed: three times for 10 minutes in 2 x SSC, 0.1% SDS and twice for 30 minutes in 

30 0.2 x SSC, 0.1 % SDS at 65°C. The dried filters are autoradiographed. Those areas of the agarose correspond- 
ing to positive signals are further processed as described above. 

Example 11: isolation of cosmid dones containing parts of the cyclosporin synthetase gene 

35 a) Construction of a genomic cosmid gene library from Tolypociadium niveum 

Protoplasts are produced as described in Example 17. Approximately 10® protoplasts are carefully lysed 
in2mlofTE(10mMtris-HCI, 1 mM EDTA, pH 8.0). 0.1 mg/ml of RNase Aareadded and incubation is continued 
for 20 minutes at 37**C. After the addition of 0.5% SDS and 0.1 mg/ml of proteinase K, incubation Is continued 

40 for a further 40 minutes at 55®C. The batch is very carefully extracted twice with each of TE-saturated phenol, 
phenol/chloroform (1:1) and chloroform/isoamyl alcohol (24:1) (Maniatis et aL, 1982). The aqueous, slightly 
viscous supernatant is combined with one tenth its volume of 3 M sodium acetate (pH 5.2) and covered with 
a layer of 2.5 times its volume of absolute ethanol at -20''C and the DNA, found as fine threads at the phase 
interface, wound up using glass rods. The DNA is dfesolved in 3 ml of TE for at least 20 hours. Depending on 

45 the quality of the protoplasts, approximately 500 ^g/ml of DNA are obtained. Analysis with field inversion gel 
elecrophoresis (FIGE) (0.8% agarose. 0.5 x TBE (Maniatis et aL, 1982), 6 V/cm. forwards pulse 0.2 to 3 sec, 
pulse ratio 3.0. running time 5 hours) gives a size greater than 150 kb. Two batches of 135 i^g of DNA are 
deaved with 7.5 and 15 units respectively of restriction enzyme Nde ll (from Boehringer Mannheim) for 1 hour 
at 37*'C in 1 ml of buffer (tris-acetate 33 mM, magnesium acetate 10 mM, potassium acetate 66 mM. DTT 0.5 

so mM, pH 7.9). Aliquots of the deaved DNA are tested with FIGE and give a maximum size for the fragments 
obtained of approximately 45 and 30 kb respectively. 

Using a gradient mixer, linear NaCI density gradients from 30% to 5% in 3 mM EDTA pH 8.0 are produced 
in ultracentrifuge microtubes and the DNA fragments applied. After centrifugation for 5 hours at 37,000 rpm 
and 25'>C (Beckman L7-65 ultracentrifuge, rotor SW 41 ), the gradient is harvested in 500 ^1 fractions. Fractions 

55 with DNA greater than 30 kb and less than 50 kb are dialysed three times for two hours against TE (tris-HCI 
10 mM, EDTA 1 mM, pH 8.0), precipitated with ethanol and each dissolved in 50 ^1 TE. 

sCosI (from Stratagene) is used as the doning vector. The vector arms deaved with BamHI and Xba l are 
produced and modified as stated by Evans et aL, (1989). 1^g of the deaved vector are Itgated with approxi- 
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mateiy 500 ng of the DNA fragments in 20 of ligation mix (tris-HCI 66 mM, MgCl25 mM. DTE 1 mM, ATP 1 
mM, pH 7.5) with 16 units of T4-DNA ligase (from Boehringer) for 16 hours at 12*»C. 4 ^l portions of the batch 
are packaged into lambda phage heads with packaging extracts (Gigapak, from Stratagene). E. coli SRB (from 
Stratagene) is used as the host strain for the infection and the bacteriophage lambda-competent cells are pro- 

5 duced following the method of Sambroock et al., (1 989). After infection, the batches are plated in aliquots onto 
LB medium (Maniatis etaL, 1982) with 75 iig/ml of ampicillin. Recombinant dones are discernible as colonies 
after 20 hours at37**C. In total, approximately 50,000 colonies are obtained, which are then suspended in 0.9% 
NaCly20% glycerol and stored at -70'=*C. Analysis of 40 randomly selected clones by isolation and restriction of 
the cosmids obtained shows that all the clones contain recombinant cosmids; the average insert size is 36 

10 kb. 

b) Isolation of cosmid clones 

The cosmid gene library is plated at a density of approximately 2500 colonies per 85 mm plate on LB me- 
15 dium with 75 ^g/ml of ampicillin (Maniatis et al., 1982). Transfer of each onto two nylon membranes (Duralon 
UV, Stratagene) is performed as described In Sambroock etal., (1989). The 1.6 kb Hindi 1 1 fragment from XsynS 
(see figure 1) is labelled with alpha-^^p.^jATp using "Random Primin g" (from Stratagene) and is used as a 
hybridisation probe. Prehybridisation Is performed for 6 hours, hybridisation for 18 hours at 42*'C in 5 x SSC, 
40% formamide, 5 x Denhandrs (Maniatis a/., 1982), 0.1% SDS, 25 mM NaH2P04, pH 6.5, and 250 |ig/ml 
20 of herring sperm DNA. The filters are washed twice for 10 minutes in 2 x SSC/0.1% SDS at room temperature 
and twice for 40 minutes in 1 x SSC/0.1% SDS at 60**C. The membranes are exposed for 14 hours on X-ray 
film (Kodak Xomatic AR). Colonies having positive signals are purified, the corresponding cosmid-DN A isolated 
from the colonies and characterised by various restriction analyses and hybridisations with the labelled Xsyn3 
probes, and the vector- DNA sCosl. Figure 5 shows the restriction map of the cloned regions of such a cosmid, 
25 syncosl3; the Tolypociadium niveum DNA contained in it amounts to approximately 35 kb and also includes 
the region of Xsyn3. 

Example 12: Isolation of a PI clone with the complete gene for cyclosporin synthetase 

30 Protoplasts are produced from Totypocladium niveum as described in Example 1 7 and suspended at a den- 

sity of 10^/ml in TPS. 1 ml portions of this suspension are mixed with 1 ml of 1.6% melted agarose (incertfirom 
FMC) held at 40''C and cast into small 1.5 mm thick blocks using a casting stand (BioRad). After solidifying, 
the blocks are fransferred into lysis buffer (0.45 M EDTA pH 8.0, 1% N-lauroyI sarcosin, 1 mg/ml proteinase 
K) and incubated for 16 hours at 55**C. The blocks are washed for thrice for 2 hours in 0.5 M EDTA pH 8.0 

35 while being slowly rocked and are then stored at4''C. Before being cleaved, the blocks are cut into small strips, 
transferred into Eppendorf microtubes and washed for four times for 2 hours and once for 16 hours in TE. The 
blocks are preincubated in four parallel batches at4°C, each in 300 ^l BamHI buffer (from NEB), supplemented 
with 100 ^g/ml of bovine serum albumin (from NEB) and 80|iM S-adenosylmethionine, for 3 hours on ice. Then, 
2 units of BamHI (from NEB) and 16, 20, 24 or 28 units of BamHI methylase (from NEB) are added to each 

40 batch and incubation is continued for a further 90 minutes on ice and then for 1 hour at 37''C. The reactions 
are arrested by the addition of 20 mM of EDTA and 0.5 mg/mt of proteinase K and incubated at 37''C for 30 
minutes. 

The blocks are applied to a 1 % agarose get (Seaplaque GTG from FMC) and the DNA fragments separated 
by pulsed field gel electrophoresis ((Chef DR 11 from BioRad), 0.5 x TBE (Maniatis etaf., 1982), switch interval 

45 of 8-16 sec. 150 V. 16 h, 12°C). 

The region of DNA fragments between 70 and 100 kb is cut out of the gel and the agarose hydrolysed 
with p-agarase (from NEB). The DNA solution obtained in this manner is very carefully extracted once with 
tris-saturated phenol and once with chloroform/isoamyl alcohol (24^1) and then concentrated to a final volume 
of approximately 100 ^1 by extraction with 1-butanol. 

50 pNS528tet14-Ad10-SacllB (from DuPont-NEN) is used as the cloning vector. The vector arms are pre- 

pared as stated in Pierce et a/., (1992). Approximately 250 ng of the deaved vector are ligated with approxi- 
mately 500 ng of the DNA fraction for 16 hours at 16^C (performed as in Example 11 , total volume 15 ^il). After 
heating the ligation to 70''C for 10 minutes, 4|xl aliquots are cleaved with pacase (from DuPont-NEN) and pack- 
aged into bacteriophage P1 envelopes by addition of the "head/tail" extract, as described in Pierce and Stern- 

55 berg (1991). After infection of E, coli NS3529. the preparation is plated onto LB medium (Maniatis et a!., 1982) 
with 25 |ig/ml kanamycin and 5% saccharose. Recombinant dones become visible after incubation of the 
plates at 37<*C for 20 h. 

In total, approximately 2000 colonies are obtained, which are stored as a pool in 0.9% NaCI/20% glycerol 
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at -70^C as "PI library". 

The gene library (10 x 500 colonies) is screened as described in Example 11 (cosmid clones), inter alia, 
a positive clone is obtained which contains all the fragments of the cosmid done syncos!3. together with ad- 
ditionally a further approximately 30 kb of the cyclosporin synthetase gene in the 5' direction. Hybridisation 
5 with oligonucleotide mixtures derived from suitable amino acid sequences (see Example 9 and Example 10) 
shows that alt the tested sequences are present on this PI clone (synp4). In this way. It is ensured that the 
complete gene for cyclosporin synthetase is contained on this done synp4. 

Example 13: DMA partial sequence of the cyclosporin synthetase gene from Toiypocladium niveum 
10 ATCC34921 

a) The DNAdoned as described in Examples 11 and 12 is sequenced and is Olustrated as Seq Id 1. 

b) A polypeptide with the amino acid sequence illustrated as Seq Id 2 is be derived from this DMA. 

15 Example 14: Comparison of the amino acid sequences derived from the DNA with the cydosporin synthe- 
tase amino acid partial sequences 

The DNA of Seq Id 1 is translated on the basis of the genetic code into an amino acid sequence (i.e. position 
1 of the protein sequence corresponds to position 885 of the DNA sequence) and is compared with the amino 
20 acid sequences given in Example 9: 

AA-Partial sequence 3: in Seq Id 2, position 12254 is T. Otherwise all amino acids correspond. 
AA-Partial sequence 4: all amino acids correspond. 
AA-Partlal sequence 5: all amino acids correspond. 

AA-Partial sequence 9: in Seq Id 2, position 13730 is W. Otherwise all amino acids correspond. (Position 13 
25 of the AA partial sequence aa9 could not be determined.) 
AA-Partial sequence 10: all amino acids correspond. 
AA-Partial sequence 12: all amino acids correspond. 
AA-Partial sequence 13: all amino acids correspond. 

AA-Partial sequence 14: in Seq Id 2, position 9565 is C. Otherwise all amino acids correspond. 
30 AA-Partial sequence 15: all amino acids correspond. 

AA-Partial sequence 16: Position 1 of the AA partial sequence aa16does not correspond to the AA sequence 
of Seq Id 2. Otherwise all amino acids correspond. 

AA-Partial sequence 19: in Seq Id 2, positions 9082 and 9083 are R and Y. Otherwise all amino acids corre- 
spond. 

35 AA-Partial sequence 20: in Seq Id 2, position 6545 is W. Otherwise all amino acids correspond. 

Further, internal comparison of the amino adds 13804-14063 of Seq Id 2 with amino acids 12304-12563 
of Seq Id 2 shows that 178 out of 259 amino acids are identical (68.7%). A further 28 amino acid residues 
(1 0.8%) are functionally similar. In total, 11 partial regions similar to each other may be identified in this manner. 

40 Example 15: Isolation of RNAfrom mycelium of Tolypociadium niveumand Northern hybridisation 

A 1 I conical flask with 100 ml of medium 4 (Dreyfuss et al., 1976) is inoculated with a spore suspension 
of Tolypociadium niveum ATCC34921 (1 x 10^ spores/ml) and shaken for 96 hours at 250 rpm and 25*'C. II con- 
ical flasks with 100 ml of medium 5 (Dreyfuss ef a/., 1976) are inoculated with 10 ml of this preculture and 

45 shaken for 7 days at 25^*0 and 250 rpm. The cydosporin A concentration is determined (Dreyfuss et ai., 1 976) 
to be 100 ^g/ml. 8 g of moist mycelial mass is filtered, washed with TE (10 mM tris-CI pH 7.5, 1 mM EDTA) 
and ground to a fine powder in a mortar under liquid nitrogen. RNA is then isolated according to the method 
described by Cathala et al„ (1983). 4 mg of RNA are obtained, which are stored at -ZO^'C. 10 ^g of the RNA 
are separated on a denaturing 1.2% agarose gel containing 0.6 M formaldehyde. The electrophoresis buffer 

50 is 0.2 M MOPS. 50 mM sodium acetate. 1 0 mM EDTA, pH 7.0. The RNA is dissolved in a buffer mixed together 
from 0.72 ml formamide, 0.16 ml of 10 x concentrated electrophoresis buffer, 0,26 ml formaldehyde, 0.18 ml 
water and 0.10 ml glycerol. The samples are heated to 100*'C for 2 minutes and separated at 115 V, 100 mA 
over 2 hours. The gel Is shaken three times for 20 minutes in 10 x SSC, blotted onto Hybond N-Filter and fixed 
by UV treatment Hybridisation is performed at 42**C in 6 x SSPE. 50% formamide, 5 x Denhardt's solution, 

55 0.1% SDS, 100 \iglm\ denatured herring sperm DNA, and 100 ^iM ATP. The 32p-iabelled DNA (fragments of 
the cloned DNAs described in Examples 9 to 12) are heated to 100°C for 5 minutes and cooled in ice before 
hybridisation. After 16 to 20 hours, the filters are washed: three times for 10 minutes in 2 x SSC, 0.1% SDS 
and twice for 30 minutes in 0.2 x SSC, 0.1% SDS at 65''C. The dried filters are autoradiographed. If the fragment 
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used as the probe is a fragment of the cyclosporin synthetase gene, a band may be detected on the X-ray 
film after 24 to 72 hours of autoradiography at -70^C. The band exhibits distinctly less mobility than the largest 
of the comparison RNA used (9500 b; RNA-ladder, BRL). Figure 1 summarises the results of such hybridisa- 
tions: in relation to the restriction map of a X-done, the isolation of which is described in Example 10, the pos- 
5 itions of individual restriction fragments are given which were used as probes in Northern hybridisations. The 
fDled-in rectangles indicate that the bands described above may be detected (E2, E3, E1. S3, S5), while the 
rectangles with the transverse lines stand for those fragments which do not hybridise with such a band (E4, 
S2). (Fragment S4 was not used as a probe). 

10 Example 16: Identification of homologous synthetase genes 

100 ml of medium 1 (Dreyfuss et ai, 1976) are inoculated with 1 x 10^ fungal spores and shaken for 72 
hours at25°C and 250 rpm. The mycelium is filtered out, washed with TE and lyophllised. 100 mg of lyophilised 
mycelium are added to 700 ^1 of lysis buffer (200 mM tris-CI pH 8.5, 250 mM NaCI, 25 mM EDTA. 0.5% SDS) 

IS and 100 mg of aluminium oxide powder (Sigma A2039) in an Eppendorf homogeniser and are homogenised. 
500 ^I of phenol-chloroform are then added and vigorously mixed in. After 15 minutes centrifugation, the ex- 
traction is repeated. A volume of 3M sodium acetate pH 5.2 corresponding to 0.1 time the volume of the su- 
pernatant are added to the supernant and then a volume of i-propand corresponding to 0.6 time the volume 
of the supernatant is thoroughly mixed in. After 5 minutes of centrifugation, the pellet is washed with 70% etha- 

20 nol, briefly dried and dissolved in 100 ^1 of TE with 100 ^g/mt of RNase and incubated for 15 minutes at 37''C. 
The phenol-chloroform extraction and ethanol precipitation are then repeated. The precipitated DNA is collect- 
ed. 

5 |il portions of the DNA are cleaved with X/iol, separated on an agarose gel and k)lotted onto a nylon filter. 
This filters are hybridised with 32p_|abeHed X,SYN3 DNA as a probe. Hybridisation is performed under standard 
25 conditions, as described in Example 10 ("chromosome walking'). The hybridisations may, however, also be 
performed under less stringent conditions. 

The following hybridising bands are obtained with DNA from Tolypocladium niveum (all data are estimates 
due to mobility in the gel): 3.6 kb, 3.4 kb, 3.2 kb, 3.0 kb, 2.3 kb, 1.9 kb and 0.7 kb. DNA from Fusarium solani 
ATCC 46829 also displays bands at 3.6 kb, 3.4 kb, 1.9 kb and 0.7 kb together with a further band at approx- 
30 imately 2.1 kb. DNA from Neocosmospora vasinfecta ATCC 24402 also displays the bands at 3.6 kb, 3.4 kb, 
1.9 kb and 0.7 kb, together with two further bands at 2.9 kb and 1.8 kb. DNA from Tolypocladium geodes, Acre- 
monium sp. S42160/F, Paecilomyces sp. SB4-21622/F, Verticitlium sp. 85-22022/F (Dreyfuss, 1986) each dis- 
play several hybridising bands in the range 0.7 kb to 7 kb. 

On the basis of the DNA sequence Seq Id 1, the following oligonucleotide pairs are be synthesised: 
35 Nucleotides 35073-35092 of Seq Id 1 

Nucleotides 37848-37829 of Seq Id 1 (complementary strand) 
or also 

Nucleotides 40309-40328 of Seq Id 1 

Nucleotides 42018-41999 of Seq Id 1 (complementary strand) 

40 if 50 ng of the Tolypocladium geodes CBS723.70 DNA is amplified with the first of the two oligonucleotide 

pairs described above (Sambroock et aL, 1989): 30 cydes: 1 min 30 sec 94''C; 2 min 30 sec SO'^C; 6 minutes 
72''C, a 350 bp DNA is produced. If a part of this DNA is sequenced, the sequence given as Seq Id 3 is obtained. 
This DNA sequence is 75.1% homologous to the corresponding DNA sequence of Seq Id 1. 

Also, if 50 ng of the Neocosmospora vasinfecta ATCC 24402 DNA Is amplified with the second of the two 

45 oligonucleotide pairs described above (Sambroock et aL, 1989): 30 cycles: 1 minutes 30 sec 94X; 2 minutes 
30 sec SO'^C; 6 minutes 72''C. a 1713 bp DNA is produced. If this DNA is sequenced, the sequence given as 
Seq Id 4 is obtained. This DNA sequence is 96.3% homologous to the corresponding DNA sequence of Seq 
Id 1. 

50 Example 17: Protoplastisation and transformation oiTolypodadium niveum 
a) Method 1: 

200 ml of medium 1 (maltose (monohydrate) 50 g/l, casein peptone, digested with trypsin (Fluka 70169) 
55 1 0 g/l . KH2PO4 5 g/l, KCI 2.5 g/l pH 5.6) in a conical flask are inoculated wit h 1 0^ spores of Tolypocladium niveum 
and are incubated at 27°C, 250 rpm for approximately 70 hours. 200 ^1 of (0.1%) p-mercaptoethanol are added 
and incubation continued for a further 16 hours. The mycelium is harvested by centrifugation (Beckman J2- 
21 centrifuge, rotor JA14, 8000 rpm. 20'»C, 5 minutes), washed in 40 ml of TPS (NaCI 0.6 M, KH2P04/NaH2P04 
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66 mM pH 6.2) and the pellet volume measured by centnfugation in calibrated microtubes at 2000 g (in Beck- 
man GPR centrifuge, GH3.7 rotor. 3000 rpm. 5 minutes). The mycelium is suspended in TPS (3 ml of TPS are 
used for each 1 ml of pellet volume) and the same volume of protoplastisation solution is added (Novozym 234 
10 mg/ml from Novo Industrie batch PPM-2415). cytohelicase 5 mg/ml (from IBF), Zymolyase 20T 1 mg/ml (from 

5 Seikagaku Kogyo, batch no. 120491 ). The suspension is incubated at 27**C at 80 rpm for approximately 60 min- 
utes. The protoplasts are filtered through a milk filter, centrifuged out (700 g. 10 minutes) and taken up in a 
total of 4 ml of TPS. Each 1 ml of this suspension is layered on to 4 ml of 35% saccharose solution and is cen- 
trifuged at 600 g, 20°C for 20 minutes. The protoplast bands at the phase interface are drawn off, each diluted 
to 10 ml with TPS, centrifuged out, carefully resuspended in 200 ^1 portions of TPS and the suspensions are 

10 combined. For each 1 ml of pellet volume of starting mycelium (see above), approximately 2 x 10^ protoplasts 
are obtained. 

The protoplast suspension is centrifuged out (700 g, 10 minutes) and suspended in 1 M sorbitol, 50 mM 
CaCIa at a density of 1 x 10^. 90 ^1 portions of this suspension are combined with 10 |J of the vector DNA to 
be transformed, which contains the amdS gene Irom Aspergillus nidulans, for example plasmid p3SR2 (Hynes 

15 et al., 1983). (1 -1 0 ^ig dissolved in tris-HC1 10 mM, EDTA 1 mM. pH 8.0) and 25 yA of PEG 6000-Lsg are added 
(25% PEG 6000, 50 mM CaCb, 10 mM tris-HCI, pH 7.5, freshly prepared from the stock solutions: 60% PEG 
6000 (from BDH), 250 mM tris-HCI pH 7.5, 250 mM CaClj). The transformation batch is placed on ice for 20 
minutes and then a further 500 ^1 of the mixed PEG 6000 solution are added and carefully mixed in. After 5 
minutes at room temperature, 1 ml of 0.9 M NaCI. 50 mM CaCIa is added, the entire batch added to 7 ml of 

20 melted soft agar TMMAAC+N. held at45^C. and cast onto preheated TMMAAC+N plates. Medium TMMAAC+N 
contains 6 g/l glucose, 3 g/1 KH2PO4, 0.5 g/1 KCI, 0.4 g/l MgS04 x 7 H20, 0.2 g/l CaCIa x 2 H20, 8 mM acrylamlde, 
2.1 g/l CsCI, 1 ml/1 trace element solution, and 0.6 M NaCI. 15 g/l of Agar-Agar (Merck) are used for plates and 
7 g/l for soft agar. The trace element solution contains 1 mg/ml of FeS04 x 7 H2O. 9 mg/ml of ZnS04 x 7 H2O, 
0.4 mg/ml of CUSO4 x 5 H2O, 0.1 mg/rrd of MnS04 x H2O, 0.1 mg/ml of H3BO3 and 0.1 mg/ml of Na2Mo04 x 

25 H2O. Transformants are capable of using acrylamlde as a source of nitrogen in the medium and may therefore 
be identified after approximately 3 weeks at 25*'C as colonies against weak background growth. 

b) Method 2: 

30 Two portions each of 4.0 ml of the To/ypocladium niveum spores (ATCC 34921 ; 5 x 1 0®/ml) are introduced 

into a 1 I conical flask with 200 ml of medium 1 (50 g/l maltose (monohydrate). 10 g/l casein peptone, digested 
with trypsin, FLUKA70169, 5 g/l KH2PO4, 2.5 g/l KCI, pH 5.6) and are shaken at 25^C at 250 rpm for 65 hours. 
The mycelium is filtered out over a sterile sinterod porcelain filter with GMX nylon gauze and washed with TE 
(10 mM tris-CI pH 7.5, 1 mM EDTA) and resuspended in 40 ml of YG (5 g/l yeast extract, 20 g/l dextrose). Cen- 

35 trifugation is carried out at 900 g and 20''C for 5 minutes. The pellet is resuspended in YG (approximately 1 ml 
pellet in 5 ml) and 5 of protoplastisation solution are added to 5 ml of suspension. The protoplastisation sol- 
ution is produced from a solution containing 1.1 M KCI and 0.1 M citric acid. The pH is adjusted to 5.8 with 
KOH. Driselase (Sigma D9515) is added (15 mg/ml; storage at -20''C); the suspension remains in the ice for 
15 minutes and the starch carrier is removed by centrifugation for 5 minutes at 2000 rpm. Novozym (4 mg/ml) 

40 and bovine serum albumin (Sigma A7096. 20 mg/ml) are added. The solution is fatered through Millipore 
SLGV025LS and remains in the ice until used. The preparation is shaken at 37<>C for 2.5 hours at 250 rpm. 
The preparation is filtered through a milk filter. The protoplasts are centrifuged out (700 g; 20°C; 5 minutes) 
and carefully resuspended in STC(1.2 M sorbitol, 50 mM CaClz, 10 mM tris-HCI pH 7.5). 5 ml of 35% sacchar- 
ose solution are carefully covered with a layer of the suspension and centrifuged (600 g; 20*'C; 20 minutes). 

45 The bands are drawn off and diluted to approximately 5 ml with STC. 2x10® protoplasts are obtained from 
200 ml of culture. 

50 111 of the protoplast suspension (1 x 10®/ml) are introduced into a sterile Eppendorf tube and 5 \iQ of 
plasmid DNA in TE and 12.5 nl of PEG solution (20% PEG 4000. 50 mM CaCb. 10 mM tris-HCI pH 7.5) are 
added. This solution is mixed from separately autodaved stock solutions: 1 M CaCl2, 1 M tris-HCI pH 7.5. 60% 

50 PEG 4000 (Riedel de Haen). Once the mixture has stood for 20 minutes in ice, 0.5 nrti of PEG solution are added 
and carefully mixed in. After 5 minutes at room temperature. 1 ml of 0.9 M NaCI. 50 mM CaCl2 are carefully 
mixed in. The suspension is added to 10 ml of TM88 sorbitol soft agar (20 g/l malt extract, 4 g/l yeast extract, 
10 g/l bacto agar, 218 g/l sorbitol. pH 5.7) (45''C) and cast onto TM88 sort3itol plates (10 ml TM88 sorbitol agar: 
20 g/l malt extract. 4 g/l yeast extract, 30 g/l bacto agar, 218 g/l sorbitol, pH 5.7). After 15 to 20 hours at 25*C, 

55 1 0 ml of TM88 sorbitol agar with 600 \ig/m\ of hygromycin (45**C) are poured over. Hygromycin resistant trans- 
formants may be detected after 7 days at 25°C. 
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Example 18: Construction of vectors pSIMIO, PSIM11 and pSIM12 and transformation with these plasmids 



a) Isolation of cydophtlln gene from Tolypoctadium niveum 

5 As described in Example 10. the Tofypociadium niveum gene library is screened with a radioactively lak>- 

elled DNA probe. Hybridisation is performed at 42''C in 6 x SSPE. 30% formamide, 5 x Denhardf s solution, 
0.1% SDS, 100 ^g/ml denatured herring sperm DNA. and 100 \ilA ATP. 32p-|abelled DNA (fragments of the 
DNA of the cyclophilin gene from Neurospora crassa, Tropschug et aL, 1 988) are heated to 1 00°C for 5 minutes 
and cooled in ice before hybridisation. After 16 to 20 hours, the filters are washed three times for 10 minutes 

10 in 2 X SSC. 0.1% SDS and twice for 30 minutes in 1 x SSC, 0.1% SDS at 45''C. The dried filters are autora- 
diographed. The purified DNA from X-phages is sut>cloned in plasmids and characterised by restriction map- 
ping, Southern hybridisation and DNA sequencing. The cDNA sequence of Seq Id 5 is obtained. The sequence 
is homologous to the cyclophilin gene of N. crassa. The start codon ATG is at positions 12-14 and the stop 
codon TAA is at positions 552-554. 

15 

b) Construction of vector pSIMIO and transformation with this plasmtd 

On the basis of the Seq Id 5. a first oligonucleotide is synthesised which is largely complementary to Seq 
Id 5 (positions 2 to 29); however, the ATG region (12 to 14) is altered In such a way that a C/al cleavage point 

20 (ATCGAT) is produced. A second oligonucleotide contains a sequence of the plasmid pUCI 8 and a recognition 
sequence for BamH\ and is given as Seq Id 6. 

A plasmid containing a 2.7 kb EcoRI-H/ncnil fragment from Example 18a cloned Into pUC18 is linearised 
with Hind\\\. 1 ng of the plasmid DNA is amplified with the oligonucleotides described above (Sambroock et 
aL, 1989): 30 cycles: 1 minutes 30 sec 94''C; 2 minutes 30 sec50**C; 6minutes72°C. A2.1 kb DNA is produced. 

25 After chloroform exfraction. this DNA is purified by ulfrafiltratlon (Ultrafree MC 100 000; Millipore) and cleaved 
in the appropriate buffer with the enzymes C/al and BamH\. 50 ng of this DNA are llgated with 50 ng of BambW 
and C/al deaved DNA of the plasmid pGEM7Zf (Promega). The newly produced plasmid is cleaved with C/al 
and Xba\ and ligated with a Ctal-Xbal restriction fragment 1.76 kb in size from the plasmid pCSN44 (Staben 
et aL, 1989). A resfriction map of this plasmid (pSIMIO) Is reproduced in figure 3. 

30 The 2157 bp 6amHI-C/al resfriction fragment of the plasmid (4714-6865 in figure 3), which contains the 

cyclophilin gene promoter, has the DNA sequence of Seq Id 7. 

The plasmid pSIMIO may be used for the fransformation of Tolypoctadium niveum, as described in Exam- 
ple 17. DNA from the transformants is cleaved with BamH\ and. after elecfrophoresis. blotted on a nylon mem- 
brane. The 1.8 kb Sg/ll fragment from pSIMIO (figure 3) is used as a radioactive probe. In this way, those of 

35 the fransformants in which the plasmid pSIMIO has been incorporated once or a plurality of times into the gen- 
ome may be identified. 

The Xhol cleavage point in plasmid pSIMIO (4924) allows the consfruction of plasmids which contain de- 
fined parts of the cyclosporin synthetase gene with which a deliberate inactivation of the cyclosporin synthe- 
tase gene is possible: 

40 pSIMII contains a 3.6 kb Xho\ resfriction fragment (42285-45909 of Seq Id 1). If the plasmid linearised 

with Eco RV is used for the transformation, approximately 30% of transformants obtained no longer form cy- 
closporin. It is shown with Southern hybridisations with DNA from such transformants that an 8.4 kb X6a/ frag- 
ment is no longer detectable, t>ut instead two new restriction fragments with 10.6 kb and 8.2 kb are detected. 
pSIM12 contains a 0.8 kb Xtio\ restriction fragment (39663-40461 of Seq Id 1). If the plasmid linearised 

45 with Sa[l is used for the fransformation, approximately 30% of fransformants obtained no longer form cyclo- 
sporin. It is shown with Southern hybridisations with DNA from such transformants than an 8.4 kb X&al frag- 
ment is no longer detectable, t>ut instead two new restriction fragments with 10.4 kb and 5.6 kb are detected. 

Example 19: Cotransformation with synp4 

50 

pSIMIO (Example 18) is used as fransformation vector. Together with this vector, equimolar quantities of 
synp4 (Example 12) are also used in the same fransformation batch. These co transformations are performed 
according to the method described in Example 1 7 and Toiypocladium niveum AJCC 34921 is used as the start- 
ing strain. 

55 Genomic DNA from hygromycin resistant fransformants is isolated according to a rapid method. To this 

end, mycelium is taken from an area of approximately 1 cm^ of the corresponding colony and transferred into 
Eppendorf homogenisers. 1 ml lysis buffer (50 mM EDTA, 0.2% SDS) and 100 mg aluminium oxide (grade A5. 
from Sigma) are added and thoroughly homogenised for approximately 5 minutes. After cenfrifugation (5 min- 
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utes. 11 ,000 rpm) the supernatant is extracted once with each of tris-saturated phenol, phenol/chloroform (1:1) 
and chloroform/lsoamyl alcohol (24:1) and the DNA precipitated with isopropanol using the standard procedure 
(Sambroock et aL, 1989). 

The DNA is completely restricted with the restriction enzyme Sa/I. separated with gel electrophoresis and 
5 investigated in Southern hybridisations. The 0.8% agarose gel is transferred by vacuum blotting (Vacu blot, from 
Pharmacia) onto a nylon membrane (Duralon-UV from Stratagene) and fixed with UV. 

As probe for the hybridisations, the small Spel restriction fragment from the bacteriophage PI vector pNS- 
528tetl4-Ad10-SacllB (from DuPont-NEN) is prepared by gel electrophoresis and Genedean II Kit (from 
BIO101) and radioactively labelled with alpha ^^p dATP by "random primer" synthesis (from Stratagene). 
10 Prehybridisation is performed for approximately 8 to 16 hours at 42*C in 6 x SSC, 50% formamide, 5 x 

Denhardt's (Maniatis etai, 1982), 0.1% SDS. 0.25 mg/ml denatured herring sperm DNA, and 25 mM NaH2P04 
pH 6.5 in a volume of 1 0 ml per 1 00 cm^ of membrane. After addition of the labelled probe, incubation is con- 
tinued for a further 16 to 20 hours at 42^C. The blot is washed twice for 10 minutes with 2 x SSC/0.1% SDS 
at 25**C and twice for 30 minutes with 0.5 x SSC/0.1% SDS at 60X. After autoradiography for approximately 
15 48 to 96 hours at -70*>C with Kodak intensifying film onto X-ray film (Xomatic AR, from Kodak), bands become 
visible on the X-ray film. 

Some of the investigated DMAs display hybridisation signals which are attributable to the integration of 
synp4. The number of signals, which should correlate with the number of integrated synp4 molecules, varies 
between 1 and 3. 

20 A transformant strain verified in this manner is investigated for cyclosporin Aformation by test fermentation 

in a shaking flask as described by Dreyfuss et at. (1976). Whilst approximately 100 ng/ml of cyclosporin A is 
formed in parallel tests of the untransformed starting strain Tolypocladium niveum ATCC 34921 , approximately 
150iig/ml of cyclosporin A is detected In tests with the strain in which additional copies of the cyclosporin syn- 
thetase gene are present due to the integration of synp4. 

25 





Abbreviations used: 




ACV 


aminoadipyl-cysteinyl-valine 




amdS 


acetamidase gene 


30 


ATCC 


American Type Culture Collection 




ATP 


adenosine triphosphate 




bp 


base pairs 




CBS 


Centraalbureau voor Schimmelcultures 




DTE 


dithloerythritol 


35 


DTT 


dithiothreitol 




EDTA 


ethylenediaminetetraacetic acid 




HEPES 


N-2-hydroxyethyi-piperazine-N-2-propanesulphonlc acid 




MOPS 


3-morpholinepropanesulphonic acid 




PEG 


polyethylene glycol 


40 


pfu 


plaque forming units 




SDS 


sodium dodecyl sulphate 




SDS-PAGE 


SDS-polyacrylamide gel electrophoresis 




SSC 


150 mM NaCI, 15 mM sodium citrate, pH 7.0 




SSPE 


180 mM NaCl 10 mM sodium phosphate, 1 mM EDTA, pH 7.7 


45 


TE 


10 mM tris-CI pH 7.5. 1 mM EDTA 




TFA 


trif tuoroacetic acid 




tris 


tris(hydroxymethyl)aminomethane 




YAC 


yeast artificial chromosome 



Moreover, the customary abbreviations for the restriction endonucleases are used (Sau3A, Hind[\\, EcoRI, 
50 H/ndlll, C/al etc.; Maniatis et al., 1982). The nucleotide abbreviations A, T, C, G are used for DNA sequences 
and the amino acid abbreviations (Arg, Asn, Asp. Cys etc.; or R, N, D, C etc.) for polypeptides (Sambroock et 
aL, 1989). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i) APPLICANT: 

(A) NAME: Sandoz Ltd 

(B) STREET: Lichtstrasse 35 
<C) CITY: Basel 

(S) COUNTRY: Switzerland 

(F) POSTAL CODE (21?) : CH-4002 

(G) TELEPHONE: 41-61-324 4395 

(H) TELEFAX: 41-61-322 7532 

(A) NAME: Sandoz -Patent -GmbH 

(B) STREET: Humboldstr. 3 

(C) CITY: Loerrach 

(E) COUNTRY: Germany 

<F) POSTAL CODE (ZIP) : D-785C 

<A) NAME: Sandoz-Erf indungen Verwaltungsgesellschaf t 
20 mbH 

<B) STE^ET: Brunnerstr 59 

(C) CITY: Vienna 

<E) COUNTRY: Austria 

(F) ^POSTAL CODE (ZIP); A-1235 

(ii) TITLE OF INVENTION: Cyclosporin Synthetase 
(iii) NUMBER OF SEQUENCES: 7 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

^ (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 46899 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNSSS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
^ (iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tolypocladium niveum 
^ (B) STRAIN: ATCC 34 921 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAATTCAGTA TCGGGCAAAT CTTCATGGTG ATGTGAATCT AGCGAGATGA ATC3CAGGAGA 60 

SQ ATCGGCTGGG ATGGCCTCCA GATATACACC CTTCTAGCAT CACAAATCCC GCCGATGTAC 120 

AAGCCCCACG ACGAACGTTC TTATTGGCTT AACCGCTACT AGTATTTTTA TATAGTAGTT 180 

TATATGCGTA CSGTACTCTCT TCTGTTAATG TCAGAGGATC TATTGCGATG GGCAGGCTCSC 240 



55 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AGC AATGCCT 


CGATCTTGAT 


GGAGGGATAG 


TTGTTTGCTG 


ATGAGTATAG 


GTACTTATTC 


300 


TATTAGTAAC 


TCTATGCTTG 


TTTTAAGGTA 


CCGATACTCG 


TACGTCGATC 


G?GGGGGGTG 


360 


TAAGCCACGT 


GGTCCACAGT 


CTGACGAAGT 


TTCGAACCCT 


TCAGGGATTA 


TTAACAAGGT 


420 


AATACGGAGT 


AAAGGAGTAG 


TATCATAGCT 


TGGAATATGT 


GGAAACCCCG 


AGGAGGCAAT 


480 


CCCCTTGGCT 


GTCAGATTAC 


CTTACAAGTC 


TCCATCTACT 


GACCACGAAC 


TGAACTCAGT 


540 


TCCTTCAGTC 


GCTTACTATT 


TACTGGAACA 


TCTCCTCGAA 


TTTGGAAAAA 


GAAAAAAGCA 


600 


CCAACAAAAA 


CTCAGGAGAT 


CCACTCTTTA 


TCGGACACAA 


ATAGCTACTT 


GCTTTCTGTG 


660 


CCGTGCAACG 


ATACTGTCGG 


AAAGCTCGAC 


CTACGAGCCA 


CTTACACCTG 


TGGTAGCAGC 


720 


ACAAAGCCGG 


ACTCGCCACA 


ACTCAGCAAC 


TAGCCATTCG 


AAATCGCAAA 


CTACAGCAGC 


780 


TACACGAACT 


TCATGAGATG 


GATTGTACAT 


ACTGACTACA 


CTAGGTTTAC 


TAACAGATAG 


840 


ACAACCATTG 


CCAGATTATA 


GAGCCTTTTG 


CTTTCTTGGT 


CAACATGGGC 


GCCATCGGGC 


900 


AAGACATGGC 


ATATGATCGC 


CTTGCCAACC 


CGTCTCGGGC 


GAGTTCCATC 


TCTTCGAACC 


960 


GATACTCCGA 


ACCTGTCGAG 


CAATCCTTTG 


CCCAGGGCAG 


ACTGTGGTTC 


CTGCACCAGC 


1020 


TGAAGCTCGG 


TGCGAGCTGG 


GACATTACGC 


CGGCCGCGAT 


CCGACTTCGG 


GGCCATCTCG 


1080 


ACATCGATGC 


GCTGAACGCT 


GCCTCGCGCG 


CTCTGACGCA 


GCGCCACGAG 


ACGCTCCGAA 


1140 


CGACGTTCAA 


GGAGCAGGAT 


GGCGTGGGCG 


TACAGGTTGT 


GCACGCCTCG 


GGCCTCGAAA 


1200 


GAGGGTTGAG 


GATTGTTGAT 


GCCTCGAGCC 


GCGATTTGGC 


CCAGCTCCTG 


GCAGAGGAAC 


1260 


AAACCATGAA 


GTTCGACCTA 


GAGTCTGAGC 


CAGCTTGGAG 


AGTTGCATTG 


TTGAAGGTGG 


1320 


CCGAGGATCA 


CCATATTCTT 


TCCATTGTTG 


TACACCATAT 


CATCTCAGAC 


AGCCGGTCTC 


1380 


TCGACATTAT 


TCAGCAGGAG 


CTTGGAGAAC 


TCTACACGGC 


CGCCTCGCAG 


GGGAAATCGA 


1440 


TTTCGGCTTG 


TCCCTTGGGT 


CCAATTCCCA 


TTCAATACCG 


TGACTTGACG 


ACTTGGCAGA 


1500 


ACCAGGACGA 


GCAGGTCGCT 


GAGCAGGAAA 


GGCAGCTCGG 


ATACTGGATC 


GAGCAGCTCG 


1560 


ATAACAACAC 


ACCGGCCGAG 


CTCCTCACAG 


AGCTTCCCCG 


GCCAGCTATC 


CCATCTGGCG 


1620 


AAACTGGCAA 


GATCTCCTTC 


CAGATCGATG 


GATCGGTACA 


CAAAGAACTC 


CTGGCCTTCT 


1680 


GCCGCTCCCA 


GCAAGTAACC 


GCCTACGCCG 


TGCTGCTGGC 


AGCGTTTCGC 


GTGGCGCACT 


1740 


TTCGCCTCAC 


TGGAGCCGAG 


GATGCAACCA 


TCGGAGCGCC 


CGTTGCCAAC 


CGCGACCGGC 


1800 


CGGAGCTGGA 


GAACATGGTG 


GCTCCCTTGG 


CCACTCTGCA 


GTGCATGCGA 


GTCGTGCTCG 


1860 


ACGAGGACGA 


CACCTTCGAG 


TCGGTGCTGC 


GGCAGATCAT 


GTCCGTCATG 


ACAGAGGCAC 


1920 


ATGCCAACCG 


CGACGTCCCC 


TTTGAGCGCA 


TCGTGTCTGC 


GTTGCTGCCC 


GGGTCGACAG 


1980 


ACACATCACG 


ACACCCGCTT 


GTGCAGCTCA 


TGTTTGCTTT 


GCATCCCQCG 


CAGGATACGG 


2040 


GCCGAGCCCG 


GTGGGGGTTC 


CTCGAGGCTG 


AGACTCTGCA 


GAGTGCGGCC 


CCGACACGAT 


2100 


TCGACATGGA 


GATGCACCTG 


TTTGAGGGAG 


ACGACCGGTT 


CGATGCAAAC 


GTGCTGTTCT 


2160 


CCACGGGCCT 


TTTCGACGCA 


GAGGCCATCC 


GCAGCGTGGT 


TTCTATCTTT 


CGGGAAGTCC 


2220 


TGCGCCGTGG 


CATCTCGGAG 


CCTGCGGTGC 


ATGTGAAGAC 


GATGCCGCTC 


ACCGATGGGC 


2280 
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5 


TCGCCGCGAT 


CCGGGACATG 


GGCTTGCTGG 


ATATCGGGAC 


CACCGACTAC 


CCCCGCGAGG 


2340 




CGAGCGTGG T 


TGATATGTTC 


CAAGAGCAGG 


TGGCCTTGAA 


TCCAAGCGCC 


ACCGCCGTGG 


2400 




CCGATGCTTC 


GTCCAGATTG 


AGCTACTCTG 


AGTTGGATCA 


CAAGTCAGAT 


CAGCTGGCCG 


2460 






CAGACGGCAG 


CTCAAGCCCG 


AGACCTTGAT 


TGGCGT3TTG 


TCTCCTCCGT 


2520 


10 




CATGGT'^'TCC 


TTCCTCGGTA 


TCCTCAAGGC 


TCATCTGGCT 


TATCTGCCTC 


2580 






f GTTPPPTTG 

J. X v^V^W J. X w 


GCACGCATCG 


AATC AATCC T 


TTCGGCCGTG 


GACGGGCACA 


2640 






fJPTTGGGAGC 

X X w WW AVJW 


AACGTGCCCC 


AACCCAAGGT 


GGATGTACCC 


GATGTTGAGT 


2700 


15 




S»AO\«\<3AX 


CTGAACGGGT 


C "C AGGTGAA 


TGGGCTTGCA 


GGGAAACAGG 


2760 






X wV9V5V» w 


ACGG AC CTGG 


CCTACGTCAT 


CTTCACCTCG 


GGATCGACTG 


2820 






OU\,3 X VJ X ^«AX VJ 


AT^PGAGPATP 


GGGGCATCGT 


ACGCCTCG'i'G 


AAAGGAACAA 


2880 


20 


aIL.. A 1 1 A X i. X ^ 




GP AGP AGTGP 


CGACAGCTCA 


CCTGGCCAAC 


ATCGCTTTCG 


2940 




AL^V,. X ^ 1 AA^ 


A TGGG AG ATP 


TATACCCCTA 


T CCT T AA'T'GG 

X ^^^^ X X x*r»« \^x^ 


CGGCACTCTT 


GTCTGTATCG 


3000 




A Ar* AOTr'T^^T 
aaVi* aV^ X V« X u X 


PAPGPTAGAT 

WAV^UVa X AV3A X 


AGCAAGGCAC 


T AGAAGC^^G T 


ATTCACCAAG 


GAGGGCATTC 


3060 


25 


Vi9 X V3 X V7V3V«> V« X X 


PPTTGPTPPT 

WW' X X VJV^ X wV« X 


GCTCTGATCA 

^7^^ X >^ X \^m^ X ^^X% 


AGCAGTGTCT 


CGCCGACAGA 


CCGGCGATCT 


3120 


TTCrCGGCCT 

X X V3\^wwVJV^>v X 


GG AT AGCCT G 


TACGCTATTG 


GCGATCGCTT 


CGACCGACGT 


GACGCCCTCC 


3180 




A^ V3x« AAA.U X \^ 


CTTGGTGAAG 


CATGGCGTTT 


ATAATGCCTA 


TGGTCCAACC 


GAGAATTCCG 


3240 






C ATC T AC AGC 


GTCTCCGAGG 


CTTCACCGTT 


TGTCACGGGG 


GTGCCCGTTG 


3300 


30 


GCCGGGCCAT 


CAGCAACTCG 


GGCGCCTATG 


TAATGGATCA 


GGATCAGCAA 


TTGGTCTCTC 


3360 




P'^GGGGTGAT 


GGGGG A.GCTT 


GTGGTTTCTG 


GAGATGGCCT 


AGCTCGAGGA 


TATACCGATT 


3420 






TAAGAACCGA 


TTTGTCGTGG 


TGCAGATTGA 


CGGCGAGTCA 


ATCCGGGGCT 


3480 


35 


ATCGT AC GGG 


AGACCGGGCC 


CGATACAGCC 


TCAAGGGTGG 


CCAGATTGAG 


TTCTTTGGCC 


3540 




GCATGGATCA 


GCAGGTCAAG 


ATCCGTGGCC 


ATCGTATCGA 


GCCAGCCGAG 


GTAGAGCACG 


3600 




CTTTACTCAA 


CAGCGACCAA 


GTACGCGATG 


CAGCAGTGGT 


TATCCGGAGA 


CAGGAGGAGG 


3660 


40 


AAGAGC C TGC 


GATGATTGCC 


TTCGTTACGA 


CGCAGGGTAC 


GCTCCCTGAT 


CACCTCGTCA 


3720 


ACATCAACGG 


CAACGGCCAC 


GTTCCCGACG 


GCAACGGCAG 


CAAGAACGAC 


CAATTCGCCG 


3780 




TTCACGTCGA 


GAGCGAACTG 


CGCCGGCGCT 


TGCAGATGTT 


GCTGCCCTCC 


TACATGATGC 


3840 




CGGCCCGCAT 


CGTGGTGCTT 


GACCATCTCC 


CTCTCAACCC 


CAACGGCAAA 


GTCGACCGGA 


3900 


45 


AGGCGCTGGG 


TCAGTCGGCC 


AAGACTGTGC 


AGAAGAGCAA 


GCTGGTCTCA 


CAGCGCGTCG 


3960 




CCCCACGCAA 


TGAGATCGAG 


GCCGTGCTTT 


GCGAGGAGTA 


CAGGAGTGTG 


CTTGGTGTCG 


4020 




AGGTTGGCAT 


CACCGATAAC 


TTCTTCGACC 


TGGGTGGTCA 


TTCCTTGACG 


GCCATGAAGC 


4080 


50 


TCGCGGCACG 


GATCAGCCAG 


AGGCTCGACA 


TTCAAGCATC 


CGTAGCAACT 


GTCTTTGAGC 


4140 




AGCCGATGCT 


CGCTGACCTC 


GCCGCCACGA 


TCCAGCGCGG 


CTCGACTCTG 


TATAGCGTCA 


4200 




TCCCTACGAC 


AGAATACACG 


GGACCGGTGG 


AGCAATCATT 


TGCCCAAGGC 


CGTCTGTGGT 


4260 
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20 
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30 



35 
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TCCTTGAGCA 
GAGGCCACCT 
AGACTCTTCG 
GCCTCATGGG 
AGGCACTGAT 
TGGCGCTGCT 
CGGATGGATG 
TGCGCGGCCA 
CGGTCTGGCA 
GGACCAAGCA 
CGATTCTATC 
CGCTTCAAGT 
TCCGTGCAGC 
CGAACCGGAA 
TACGTATCAC 
CGACTACAGC 
TGCCGAGCAG 
ACGGCCAGCA 
CAGAAGAAGT 
GCGGTGATAT 
GCGTCTTTCA 
TGCCACTCAC 
CCGACTACCC 
CCGAGGCTAC 
GGTCCGATCA 
CAGTGCTCGC 
GTCATGCCTA 
CCGAGGTGAA 
AGTCGCCAGA 
GCTTGCGTGA 
CCGGATCCAC 
TGAAGCAGAC 
TGTCCAACCT 
GCTCTCTTGT 



GCTGAATACC 
CGACGTGGAT 
GACAACCTTT 
GGAGCTGCGG 
GAAGGAGCAG 
GAAGTTGGCA 
GTCTCTCGAC 
GGACCCATTG 
GAAGCAAGAC 
GCTTGCAGAC 
CGGAAAGGCT 
CTTCA3CCGC 
ACATTTCCGG 
TCGACCTGAG 
GATCGATGAA 
CGCACAGGAC 
CTCGAGAGAT 
GGATCTGTTC 
GACGAGGTTC 
CATATTCGCT 
GGAGGTTCTG 
CGACGGCATT 
CCGCAACATG 
AGCTGTTATC 
GGTGGCAGCG 
ACCACGCTCG 
CCTACCGCTC 
GGGCGAGAAG 
GGTCTCGATC 
TGGCAAGTCC 
TGGTAAACCC 
CAACATTCTA 
TGCGTTCGAT 
ATGCATTGAC 



GGCGCCTCAT 
GCGCTGGGAA 
GAGGAACGGG 
CTGATTGATA 
TCAACCCGCT 
GACCACCACA 
CTCCTACGAC 
TCGCGCCTTG 
AGCCAGCAGA 
AGCACGCCTG 
GGAAAGGTCC 
ACCCATCAAG 
CTTACGGGAT 
CTTGAGAACG 
AACGATAACT 
AATCAGGATG 
GCATCCCGGA 
AAGATCCAAC 
GACATCGAGT 
GCCGACTTAT 
AGGCGCGGAT 
CCAGAGTTGG 
TCTGTGGTAG 
GACTCATCTT 
TGGCTTCGCC 
TGCGAGGCCG 
GACGTCAATG 
CTGGTTCTCC 
GTGAGGATTG 
AAGCCAACCG 
AAGGGTGTGA 
TCCAGTCTAC 
GCATCGATAT 
AGGTTTACCA 



GGTATAATGT 
CGGCCCTGCT 
ACGGGGTTGG 
TA7CAGAGAA 
TCGACCTGAG 
TCTTCTCGAT 
ACGAGCTGGG 
AGCCACTCCC 
AAGCAGCGCA 
CAGAGCTCTT 
CCGTTGCCAT 
TCACGTCGTT 
CTGATAATGC 
TGATCGGCTT 
TTGAATCGTT 
TCCCGTTCGA 
ACCCTCTGGT 
TGGAAGGGAC 
TCCATCTCTA 
TCGAAGCCGA 
TGCAACAGCC 
AGAGGATGGG 
ACGTATTCCA 
CGCGGATGAG 
AGCGACAACT 
TCATTGCTCT 
TGCCAGCAGC 
TAGGAGCAGG 
CCGATGCCAC 
CAGGCAGCCT 
TGATCGAGCA 
CGCCGGCGCA 
GGGAGGTCTT 
TCTTGGATGC 



GATGCTCACC 
CGCCCTGGAG 
CATGCAGGTA 
ATCTGGCACT 
TCGCGAGCCA 
CGTCATGCAC 
CCAACTCTAC 
GATCCAATAC 
CCAGAGGCAA 
GACAGACTTC 
CGAGGGG'::CT 
TGCTGTCCTA 
GACTATTGGT 
CTTCGTGAAC 
GGTCCGGCAG 
ACAGGTCGTT 
GCAGCTCATG 
CGAAGAGGAG 
CCAAGGCGCC 
AACTATTCGT 
GCAGACCCCG 
CTTGTTGCAC 
ACAACAAGTT 
TTACGCCGAA 
GCCAGCCGAA 
CTTCGGCATC 
GCGTCTTCGC 
TGAGCCATCA 
GAGCCCTGCT 
CGCCTATGTC 
CCGCGGAGTC 
GACCTTCCGA 
CACGGCCCTT 
TCAAGCGTTG 



GTACGACTAC 

AAACGGCACG 

GTCCACAGCA 

GCCGCGCATG 

GGTTGGAGAG 

CACATTGTAT 

TCGGCAGCTC 

CGCGACTTTG 

TTGGAGTACT 

CCGCGGCCCT 

CTATACGACA 

CTCGCAGCCT 

GTCCCCAGCG 

ACACAATGTA 

GTCCGGTCGA 

TCCAGCCTCA 

TTTGCACTGC 

GTGATCCCAA 

AGCAAGCTGA 

GGCGTCGTCA 

ATCATGACAA 

ATGGTCAAGA 

CGTCTCAGCG 

CTGGACCAGA 

ACCTTTGTGG 

TTGAAGGCTG 

GCCATCTTGG 

CCGGAAGGCC 

GGCCATGCCA 

ATCTTCACTT 

TTGCGCCTTG 

ATGGCTCACA 

CTCAACGGAG 

GAGGCACTAT 



4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 
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^ TCCTCAGGGA GCACATCAAT ATTGCACTGT 

CGGATGCAGC TGCTACCATC AAGTCTCTTG 
ACACAGCGGA CGCAGCTCTG GCCAAAGCTC 
GCCCAACGGA AAATACGGTC ATGAGCACTT 
TTAATGGTGT GCCAATTGGA AGAGCCGTTA 
ATCAGCAGCT TGTGCCGTTG GGCGTGATGG 
CTCGTGGCTA CACCAACCCG GCTCTTGATT 

15 GCCAACTTCT CAGGGCCTAT CGCACAGGCG 

AGGTTGAGTT CTTTGGTCGG ATCGATCACC 
TCGCCGAAGT AGAACACGCT TTGTTAAGCA 
CAAACTCGCA GGAAGACAAT CAGGGAGTCG 

20 

ACGAGACTCT CCAGGAAGCA CAGTCGAGCA 
AGACCACGGC CTACGCGGAC ATCACGGCCA 
CATCCTGGAC CTCTATGTAC GATGGAACGC 
TCGACGATAC TATGCGCACT TTCCTTGACG 
GTACCGGCAC CGGTATGGTT CTATTCAATC 
GACTGGAACC TTCCCAATCC GCGGTTCAAT 
30 GGCTTGAGGG AAAGGCCCAA GTACATGTCG 

CTTTGAGCCC GGATCTGATC GTCATCAACT 
ACCTCGCCGA GGTGGTTGAG GCCCTGGTCC 
GAGACATGAG AACCTATGCC ACCCACAAAG 

35 

ACGGGAGCAA GGTGACGAGA TCTAAAGTGC 
AGGAGGAATT GCTTGTCGAC CCTGCCTTCT 
AAATAGAGCA TGTTGAGATC CTGCCGAAGA 

^ ACCGGTACGG CGCGGTTCTG CACATCCGTA 

ACAAGATCAA TGCAGAGTCC TGGATCGACT 
TTGCTAGGCT GTTGAAAGAG AACAAAGATG 

45 ACAGCAAGAC TATCGTGGAA CGGCACATCG 

ATGATACACA TAGCTCAATC GATGGAGTCG 
GCCAGTGTCC ATCTCTTGAT GTGCATGACC 
GCGTCGAGGT CAGCTGGGCC CGCCAAAGGT 

SO 

ATCACTTCCA GCCTACCGAG AACGAAAGCC 
AGGGCCAACA AGCCAGAAGC CTGACGAACC 

55 



TCCCACCCGC 


CCTGTTGAAG 


CAATGCCTCA 


6360 


ACCTCCTATA 


CGTAGGAGGA 


GACCGGTTAG 


6420 


TGGTCAAGTC 


AGAGGTCTAC 


AATGCCTACG 


6480 


TATACTCGAT 


TGCTGACACA 


GAACGATTTG 


6540 


GCAACTCTGG 


GGTCTACGTG 


ATGGACCAGA 


6600 


GAGAGCTGGT 


AGTCACTGGA 


GATGGTXTGG 


6660 


CCGACCGGTT 


CGTGGATGTC 


ATTGCTCGAG 


6720 


ACCGAGCTCG 


TTACCGGCCC 


AAGGATGGCC 


6780 


AGGTCAAGGT 


CCGAGGGCAC 


CGCATCGAGC 


6840 


GTGCCGGTGT 


GCACGATGCC 


GTTGTCGT7T 


6900 


AGATGGTGGC 


CTTCATCACC 


GCCCAAGACA 


6960 


ACCAAGTCCA 


GGAATGGGAG 


AGCCATTTCG 


7020 


TTGATCAAAA 


CACGCTCGGC 


CGAGACTTTA 


7080 


TTATTGACAA 


GAGGGAGATG 


CAGGAATGGC 


7140 


GTCAAGCAGC 


TGGCCACGTG 


CTTGAAATCG 


7200 


TCGGTCAAGC 


TGGGCTGAAG 


AGCTACATTG 


7260 


TCGTCAACAA 


GGCAGCCCAA 


ACGTTCCCAG 


7320 


GCACGGCGAT 


GGATAC3GGC 


CGGCTCAGCG 


7380 


CCGTGGCCCA 


GTATTTCCCG 


AGCCGAGAAT 


7440 


GGATTCCAGG 


CGTTCGCCGT 


ATCTTCTTCG 


7500 


ACTTCCTTGT 


TGCACGGGCG 


GTCCACACAA 


7560 


AACAGGAGGT 


GGCCCGGTTA 


GAGGAACTGG 


7620 


TCACAAGTCT 


CAAGGAATCT 


CTATCGGAAG 


7680 


ACATGAAGGT 


GAACAACGAG 


CTCAGCTCAT 


7740 


ACCACAACCA 


GAATCAAAGC 


AGGTCGATTC 


7800 


TCGCCTCAAG 


CCAGATGGAT 


AGACAGGGTC 


7860 


CCGAAAGTAT 


CGCTGTGTTC 


AACATCCCTT 


7920 


CCAAGTCTTT 


GGCCGATGAC 


CACGACGGCG 


7980 


CCTGGATCTC 


AGCCGCGCGC 


GAGAAGGCGA 


8040 


TCGTGCAGTT 


GGCCGAGGAC 


GCTGGGTTCC 


8100 


CCCAGAACGG 


CGCTCTCGAT 


GTTTTCTTCC 


8160 


GCGCGCTCGT 


CGATTTCCCC 


ACCGACTACA 


8220 


GGCCCCTGCA 


GCGGGTTGAG 


AGCCGTCGAA 


8280 
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TCGAAGCACA 


GGTCCGAGAG 


CAGCTCCAAG 


TATTGCTCCC 


GGCATACATG 


ATCCCAGCCC 


3340 


5 


GGATTGTGGT 


TCTCCAGAAC 


ATGCC3CTGA 


ACACGAGCGG 


CAAGGTAGAT 


CGCAAGGAGC 


S400 




TCACCCTTCG 


AGCCAAGGTC 


ACCGCCGCAC 


GTACGCCGAG 


CTCCGAACTC 


GTGGCTCCTC 


B460 




GTGATTCTAT 


TGAAGCCATC 


ATCTGCAAGG 


AATTCAAGGA 


TGTTCTCGGC 


GTCGAAGTGG 


S520 


10 


GTATTACAGA 


CAACTTCTTT 


AATGTCGGAG 


GACACTCTCT 


TTTGGCCACG 


AAGCTCGCAG 


8580 




CACGCCTGAG 


CCGACAACTC 


AATGCCCAGA 


TCGCAGTCAA 


AGACATCTTC 


GACCGGCCAG 


8640 




TTATCGCCGA 


TCTGGCAGCC 


ACAATCCAGC 


AGGATACCAC 


GGAGCACAAC 


CCTATCCTAC 


8700 


IS 


CGACTTCTTA 


TACGGGTCCA 


GTCGAACAAT 


CGTTCGCCCA 


AGGCCGACTC 


TGGTTCCTCG 


8760 




ATCAACTGAA 


TGTCGGCGCC 


ACATG3TATC 


TCATGCCCTT 


CGCAGTCCGG 


CTGCGAGGGC 


8820 




CTTTGGTTGT 


TTCTGCTCTC 


GCTGCCGCTC 


TTCTGGCCCT 


AGAGGAGCGC 


CACGAGACAC 


8880 


20 


TGCGAACAAC 


CTTTATCGAA 


CAGGAAGGCA 


TCGGCATGCA 


GGTCATCCAT 


CCGTTTGCCC 


8940 


CTAAGGAACT 


GAGGGTGATC 


GATGTCTCGG 


GCGAGGAAGA 


GAGCACTATC 


CAGAAGATAC 


9000 




TGGAAAAGGA 


ACAGACAACA 


CCCTTCAATC 


TCGCTTCCGA 


GCCCGGTTTC 


AGACTAGCAT 


9060 




TACTGAAGAC 


AGGAGAGGAC 


GAACACATTC 


TCTCGACAGT 


AATGCACCAT 


GC AATCTCTG 


9120 


25 


ATGGCTGGTC 


TGTCGATATC 


TT CC AACAAG 


AAATCGGCCA 


ATTCTACTCG 


GCAATCCTCC 


9180 




GCGGACACGA 


TCCTTTGGCC 


CAGATCGCAC 


CGCTCTCGAT 


CCAGTATCGC 


GATTTCGCGA 


9240 




CTTGGCAGAG 


GCAGATATTC 


CAAGTCGCAG 


AGCACCGGCG 


GCAGCTTGCA 


TACTGGACTA 


9300 


30 


AACAGCTTGC 


CGATAATAAA 


CCAGCCGAGC 


TGCTGACCGA 


TTTCAAGCGA 


CCGCCTATGC 


9360 




TCTCCGGCCG 


CGCGGGCGAG 


ATCCCGGTGG 


TCGTCGACGG 


CTTGATC7AT 


GAGAAGCTTC 


9420 




AGGACTTCTG 


TCGAATCCGC 


CAGGTGACCG 


CCTTTACCGT 


GTTGCTGGCT 


GCTTTCCGCG 


9480 


35 


CAGCGCACTA 


TCGTATGACC 


GGGACTGAGG 


ATGCGACGAT 


TGGAACACCT 


ATCGCGAACC 


9540 


GTAACCGGCC 


GGAGCTTGAG 


GGCTTGATCG 


GCTTCTTCGT 


CAACACACAG 


TGCATGCGTA 


9600 




TCACCGTCGA 


TGTAGAGGAT 


TCGTTCGAAA 


CGTTGGTTCA 


CCAGGTTCGA 


GAAACGACGC 


9660 




TGGCTGCACA 


TGCCAACCAG 


GATGTTCCTT 


TCGAACAGAT 


TGTCTCAAAC 


ATCTTGCCCG 


9720 


40 


GATCGAGCGA 


CACTTCTCGG 


AATCCGCTGG 


TACAGCTCAT 


GTTTGCTCTA 


CATTCGCAGC 


9780 




AGAACCTTGG 


CAAGGTCCGC 


CTCGAGGGTA 


TCGAGGAGGA 


GATCATCTCC 


ATTGCTGAGA 


9840 




CCACGAGATT 


TGATATCGAG 


TTCCATCTGT 


ACCAAGAGGC 


TCAGAGGCTG 


AACGGTAGTA 


9900 


45 


TCGTCTATGC 


AGCTGATCTC 


TTCGTGCCCG 


AGACTATACA 


GAGGGTGATC 


ACCATCTTCC 


9960 




AAGGCATCCT 


ACAGAAAGGC 


CTCGGCGAGC 


CGGATATGCC 


CGTCGCCTCT 


ATGGCGCTTG 


1Q020 




ATGGTGGGCT 


GGAGTCCCTC 


CGAAGCACAG 


GACTGCTGCA 


CCCTCAACAA 


ACTGATTATC 


10080 


50 


CGTGCGATGC 


TTCAGTGGTG 


CAGATCTTCA 


AACAGCAGGT 


GGCAGTCAAC 


CCGGATGTCA 


10140 




TCGCGGTGAG 


AGATGAATCA 


ACACGGCTGA 


GCTATGCCGA 


CTTGGATCGG 


AAGTCGGATC 


10200 




AAGTGGCTTG 


CTGGCTATCT 


CGGCGAGGTA 


TCGCTCCTGA 


GACGTTCGTG 


GCGATCCTGG 


10260 




CACCACGCTC 


GTGCGAGACA 


ATCGTGGCCA 


TCCTCGGTGT 


GTTGAAGGCC 


AACCTTGCAT 


10320 



55 



22 



BNSCXXID: <EP ^057861 6A2J_> 



EP 0 578 616 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



ATCTGCCTCT 


TGATGTCAAT 


GTTCCTGCGT 


CCCGGCTCGA 


GGCCATCCTT 


TCGGAGGTGT 


10380 


CGGGATCGAT 


GTTGGTCCTT 


GTGGGCGCAG 


AGACCCCGAT 


TCCGGAGGGG 


ATGGCTGAAG 


10440 


CGGAGACGAT 


CCGGATCACG 


GAGATTCTCG 


CCGACGCAAA 


GACCGACGAC 


ATCAACGGGC 


10500 


TGGCCGCGAG 


TCAGCCCACT 


GCAGCAAGCC 


TTGCGTATGT 


GATCTTTACG 


TCTGGATCGA 


10560 


CTGGTCGACC 


AAAGGGCGTC 


ATGGTCGAGC 


ATCGCGGAAT 


CGTTCGTCTT 


ACAAAGCAGA 


10620 


CCAACATCAC 


ATCCAAGCTG 


CCAGAGTCTT 


TCCACATGGC 


CCACATATCG 


AATCTTGCCT 


10680 


TCGATGCCTC 


CGTGTGGGAA 


GTGTTCACGA 


CGCTTCTCAA 


TGGAG3CACG 


TTGGTGTGTA 


10740 


TCGACTATTT 


CACTCTCTTG 


GAGAGCACAG 


CGCTCGAGAA 


GGTCTTCTTC 


GACCAACGCG 


10800 


TCAATGTTGC 


TCTGCTCCCT 


CCAGCCTTGC 


7GAAACAGTG 


CCTTGACAAC 


TCACCCGCTC 


10860 


TGGTCAAAAC 


TCTCAGCGTT 


CTCTATATTG 


GTGGTGATAG 


GCTAGATGCT 


TCTGATGCTG 


10920 


CCAAAGCAAG 


GGGGCTCGTC 


CAGACGCAAG 


CTTTCAATGC 


GTACGGCCCA 


ACGGAAAACA 


10980 


CAGTCATGAG 


CACAATCTAT 


CCCATTGCCG 


AAGACCCCTT 


CATCAATGGT 


GTGCCCATCG 


11040 


GTCATGCTGT 


CAGTAACTCG 


GGAGCTTTTG 


TCATGGACCA 


GAATCAGCAA 


ATCACCCCCC 


11100 


CTGGTGCAAT 


GGGAGAACTC 


ATCGTGACTG 


GAGACGGTCT 


TGCTCGAGGC 


TACACTACTT 


11160 


CCTCTCTCAA 


CACTGGTCGA 


TTTATCAACG 


TTGATATCGA 


TGGCGAGCAA 


GTCAGGGCAT 


11220 


ACCGCACAGG 


AGATCGAGTG 


CGCTACCGAC 


CAAAAGACCT 


CCAGATCGAA 


TTCTTCGGCC 


11280 


GTATCGATCA 


CCAGGTCAAG 


ATCCGCGGCC 


ACCGCATCGA 


ACCAGCTGAG 


GTCGAGTATG 


11340 


CTCTTCTAAG 


CCACGACCTG 


GTCACTGATG 


CGGCAGTCGT 


CACCCACTCT 


CAAGAAAATC 


11400 


AAGACCTGGA 


GATGGTTGGA 


TTCGTGGCCG 


CCCGAGTCGC 


TGATGTTAGA 


GAGGA7GAGT 


11460 


CCAGCAACCA 


GGTCCAAGAA 


TGGCAGACTC 


ACTTCGACAG 


CATCGCATAC 


GCAGATATCA 


11520 


CCACAATCGA 


TCAGCAAAGC 


CTTGGACGGG 


ACTTCATGTC 


ATGGACCTCC 


ATGTACGATG 


11580 


GCAGCCTGAT 


CAAGAAGAGC 


CAGATGCAGG 


AGTGGCTCGA 


TGACACCATG 


CGGTCACTCC 


11640 


TGGATTCCCA 


GCCCCCTGGT 


CACGTACTCG 


AAGTTGGTAC 


AGGGACTGGC 


ATGGTTCTGT 


11700 


TCAACCTCGG 


CAGAGAAGGG 


GGTCTGC AAA 


GCTACGTTGG 


CCTAGAGCCA 


TCGCCATCCG 


11760 


CAACCGCGTT 


TGTCAACAAG 


GCCGCCAAGT 


CATTCCCTGG 


GCTTGAGGAT 


AGGATCCGGG 


11820 


TTGAAGTTGG 


AACAGC AAC T 


GATATCGACC 


GGCTTGGAGA 


CGATCTGCAC 




11880 


TCGTCGTCAA 


CTCGGTCGCT 


CAATACTTCC 


CGAGTCAAGA 


CTATCTCGCC 


CAGTTGGTCA 


11940 


GAGATCTTAC 


CAAGGTCCCT 


GGCGTGGAGC 


GTATCTTCTT 


TGGTGATATG 


AGGTCGCACG 


12000 


CCATCAACAG 


GGATTTCCTT 


GTCGCTCGCG 


CAGTTCATGC 


ACTGGGCGAT 


AAGGCAACAA 


12060 


AGGCCGAGAT 


TCAACGGGAG 


GTTGTTCGAA 


TGGAAGAGTC 


TGAAGACGAA 


CTGCTCGTTG 


12120 


ATCCGGCCTT 


TTTCACCTCC 


CTGACGACGC 


AAGTAGAGAA 


TATCAAGCAC 


GTGGAGATTC 


12180 


TCCCCAAGAG 


AATGCGAGCC 


ACGAACGAGC 


TGAGCTCGTA 


TCGGTATGCT 


GCTGTTCTGC 


12240 


ACGTCAATGA 


TCTGGCGAAA 


CCGGCACACA 


AAGTCAGTCC 


TGGCGCCTGG 


GTTGATTTTG 


12300 
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CCGCGACGAA 


. GATGGATCGC 


GATGCCCTGA 


. TCCGTCTGCT 


CAGGGGCACC 


AAAATTTCCG 


12360 


5 


ACCACATTGC 


; AATCGCCAAT 


ATTCCCAACA 


GCAAGACAAT 


CGTCGAGCGA 


ACCATCTGCG 


12420 




AATCGGTTTA 


. CGACCTTGGC 


GGAGACGCCA 


AAGACTCGAA 


CGACAGAGTC 


TCATGGCTTT 


12480 




CGGCTGCTCG 


ATCGAATGCC 


GTGAAAGTGG 


CTTCCCTCTC 


CGCGATCGAT 


CTCGTTGATA 


12540 


10 


TTGCACAAGA 


GGCAGGCTTC 


CGGGTCGAGA 


TCAGCTGCGC 


GCGGCAGTGG 


TCTCAGAATG 


12600 




GCGCGTTGGA 


CGCCGTATTC 


CACCACCTTG 


GCCCATCACC 


ACAGTCGTCT 


CATGTGTTGA 


12660 




TTGACTTCTT 


GACC3ACCAC 


CAAGGTCGAC 


CAGAAGAAGC 


CCTGACGAAC 


CACCCGCTGC 


12720 


15 


ACCGAGCACA 


GTCTCGACGC 


GTCGAGAGGC 


AGATCCGCGA 


GAGACTCCAG 


ACTCTCCTGC 


12780 


CGGCCTACAT 


GATCCCGGCC 


CAGATCATGG 


TTCTTGACAA 


GCTACCTCTC 


AACGCGAATG 


12840 




GAAAGGTCGA 


CCGGAAGCAG 


TTGACGCAAC 


GGGCCCAGAC 


GGTACCCAAG 


GCCAAGCAAG 


12900 




TGTCTGCTCC 


TdGGCCCCG 


CGCACAGAGA 


TCGAAAGGGT 


GCTCTGCCAG 


GAGTTCTCCG 


12960 


20 


ACGTCCTAGG 


GGTTGATATC 


GGGATAATGG 


AAAACTTCTT 


CGATCTCGGT 


GGCCACTCGC 


13020 




TCATGGCAAC 


AAAGCTAGCC 


GCACGCATCA 


GCCGCCGACT 


AGAGACTCAC 


GTCTCCGTCA 


13080 




AGGAGATCTT 


TGACCATCCG 


CGAGTCTGCG 


ATCTTGTTCT 


CATAGTACAG 


CAGGGATCAG 


13140 


25 


CGCCTCATGA 


CCCCATCGTT 


TCGACCAAAT 


ACACCGGGCC 


AGTGCCTCAG 


TCGTTTGCCC 


13200 




AGGGTCGTCT 


TTGGTTCCTC 


GACCAGCTCA 


ACTTTGGCGC 


AACATGGTAT 


CTCATGCCCC 


13260 




TTGCCGTCCG 


TCTTCGCGGT 


GCCATGAACG 


TTCATGCTCT 


TACCGCGGCC 


TTGTTGGCCC 


13320 


30 


TCGAGAGGCG 


TCACGAGCTC 


CTCCGCACCA 


CGTTCTACGA 


ACAAAACGGC 


GTCGGTATGC 


13380 


AAAAGGTCAA 


TCCAGTTGTC 


ACCGAGACCC 


TGAGGATCAT 


TGATCTCTCC 


AACGGCGACG 


13440 




GCGACTATCT 


CCCGACATTG 


AAGAAGGAGC 


AAACTGCTCC 


GTTCCACCTG 


GAAACCGAGC 


13500 




CCGGATGGCG 


CGTGGCTCTA 


CTGCGCCTCG 


GGCCAGGCGA 


CTACATCTTA 


TCTGTCGTCA 


13560 


35 


TGCATCACAT 


CATTTCCGAC 


GGCTGGTCTG 


TGGATGTTCT 


CTTCCAAGAG 


CTGGGCCAGT 


13620 




TCTATTCCAC 


GGCTGTCAAA 


GGCCACGATC 


CCCTATCGCA 


GACCACACCC 


CTCCCGATCC 


13680 




ATTATCGCGA 


TTTTGCTCTG 


TGGCAGAAGA 


AGCCAACCCA 


AGAAAGCGAA 


CACGAGCGTC 


13740 


40 


AGCTGCAATA 


CTGGGTCGAG 


CAACTTGTAG 


ATAGTGCCCC 


GGCCGAGCTA 


CTCACGGATC 


13800 




TGCCGCGGCC 


TTCGATCCTC 


TCTGGTCAGG 


CTGGGGAGAT 


GTCGGTCACG 


ATCGAGGGAG 


13860 




CACTATACAA 


GAACTTGGAG 


GAATTCTGCC 


GGGTCCATCG 


CGTTACCTCC 


TTCGTGGTAC 


13920 


45 


TGCTTGCGGC 


CCTACGCGCA 


GCCCATTATC 


GCCTCACAGG 


TTCCGAAGAC 


GCAACTATAG 


13980 


GGACACCAAT 


CGCCAATCGT 


AACCGACCTG 


AACTTGAGCA 


GATAATCGGC 


TTCTTCGTCA 


14040 




ATACGCAATG 


TATACGCATT 


ACCGTCAACG 


AGGACGAGAC 


CTTTGAGTCA 


CTAGTGCAGC 


14100 




AGGTCCGGTC 


AACGGCGACA 


GCTGCATTCG 


CCCATCAGGA 


CGTCCCGTTC 


GAGAAGATCG 


14160 


SO 


TCTCTACTCT 


TTTGCCCGGT 


TCTCGAGATG 


CATCCCGAAA 


CCCACTTGTG 


CAGCTCATGT 


14220 




TTGCGGTGCA 


TTCGCAGAAG 


AACCTCGGTG 


AGCTGAAGCT ^ 


GGAAAACGCT 


CACAGCGAGG 


14280 




TTGTTCCCAC 


GGAGATCACG 


ACCCGGTTCG . 


ATTTGGAATT " 


CCACCTGTTC 


CAGCAAGATG 


14340 
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ACAAGCTTGA GGGCTCCATC CTCTATTCAA CCGATCTCTT CGAAGCAGTC TCGGTCCAAA 14400 

GTCTTCTTTC AGTATTCCAG GAAATTCTGC GCCGGGGTTT GAACGGTCCG GACGTGCCCA 144 60 

TCAGCACCCT ACCACTTCAG GATGGAATCG TCGACCTACA AAGACAGGGC CTGTTGGATG 14520 

TCCAGAAGAC GGAATATCCT CGTGATTCCT CTGTGGTTGA TGTGTTCCAT GAGCAGGTCT 14 580 

CGATCAACCC CGATTCCATT GCACTGATAC ATGGCTCGGA GAAGCTCAGC TACGCCCAGC 14 6 40 

TCGACAGGGA ATCTGACAGG GTCGCTCGCT GGCTCCGTCA CCGTTCTTTC AGCTCCGACA 14700 

CGTTAATGGC GGTGTTGGCG CCACGGTCTT GCGAGACGAT CATCGCGTTC CTCGGAATCC 147 60 

TCAAGGCAAA CCTTGCGTAC CTACCTCTGG ATGTCAAGGC TCCTGCTGCC CGCATTGATG 14820 

CCATTGTATC GTCGCTACCC GGGAACAAGC TCATCCTGCT GGGCGCAAAC GTTACGCCGC 14880 

CCAAGCTTCA GGAAGCGGCC ATCGATTTCG TGCCCATCCG TGATACCTTC ACTACACTCA 14 940 

CTGACGGCAC ACTTCAAGAT GGGCCTACCA TCGAGCGACC CTCTGCGCAA AGCCTAGCGT 15000 

ACGCCATGTT CACGTCTGGT TCTACCGGAC GACCGAAGGG TGTTATGGTC CAGCACCGCA 15060 

ACATCGTCCG CTTGGTGAAG AACAGTAACG TCGTC3CTAA GCAGCCCGCA GCAGCTCGCA 15120 

TAGCACATAT ATCCAATTTG GCGTTTGACG CCTCGTCTTG GGAGATCTAT GCCCCGCTGC 15180 

TCAACGGCGG CGCAATTGTG TGTGCCGACT ACTTCACAAC GATTGATCCA CAGGCTCTTC 15240 

AAGAAACCTT CCAGGAACAC GAGATCCGCG GTGCTATGCT GCCGCCCTCG CTCCTCAAGC 1530 0 

AGTGCCTGGT TCAGGCCCCA GACATGATCA GCAGGCTTGA CATCTTATTT GCTGCTGGTG 15360 

ATCGCTTCAG TAGCGTGGAT GCTCTCCAGG CCCAACGTCT CGTTGGCTCG GGCGTCTTCA 15420 

ATGCGTATGG CCCTACGGAG AATAC3ATTC TGAGCACTAT CTATAACGTT GCTGAAAACG 15480 

ACTCCTTCGT TAACGGCGTT CCCATAGGCA GTGCTGTGAG CAACTCCGGA GCCTACATCA 15540 

TGGATAAGAA CCAGCAGCTC GTGCCAGCTG GAGTTATGGG AGAACTGGTT GTTACTGGTG 15600 

ACGGTCTCGC CCGCGGCTAT ATGGATCCAA AGCTAGATGC AGACCGCTTT ATCCAACTGA 15660 

CAGTCAACGG CAGCGAGCAA GTCAGGGCAT ATCGCACCGG CGACCGTGTG CGATACCGAC 15720 

CAAAGGACTT CCAGATCGAG TTCTTCGGTC GTATGGACCA GCAAATCAAG ATCCGCGGCC 15780 

ACCGTATCGA GCCGGCCGAG GTAGAGCAGG CCTTCCTGAA TGATGGCTTC GTCGAGGACG 15840 

TTGCTATCGT TATTCGGACC CCAGAGAACC AAGAGCCTGA GATGGTCGCC TTTGTTACTG 15 900 

CTAAGGGCGA CAACTCCGCG AGAGAAGAAG AGGCTACAAC CCAGATCGAA GGTTGGGAGG 15960 

CGCATTTCGA GGGTGGTGCG TACGCCAACA TCGAGGAGAT CGAAAGCGAG GCGCTTGGTT 16020 

ACGACTTTAT GGGCTGGACG TCTATGTACG ATGGCACTGA GATCGACAAG GACGAGATGA 16080 

GAGAGTGGCT GAATGACACG ATGCGCTCTC TCCTCGATGG AAAGCCGGCT GGTCGAGTTC 16140 

TTGAGGTCGG TACCGGTACC GGTATGATCA TGTTTAACCT TGGCAGGTCA CAAGGGCTCG 16200 

AAAGGTATAT TGGCCTCGAA CCTGCACCGT CGGCAGCCGA GTTCGTCAAC AACGCTGCAA 16260 

AGTCATTCCC GGGTCTCGCG GGCAGGGCCG AAGTTCACGT CGGCACCGCC GCAGATGTCG 16320 
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G7ACCCTGCA 
CAACGCCAGA 
GCATATACCT 
CCGCCTATTC 
TGGAGGAGAA 
AGCTGCAAGA 
AGCTAAGCTC 
ACAAGATTGA 
TTGCACAAGT 
ATAGCAAGAC 
CCCCTGAGGA 
AGTGCCACAC 
TTGAGGTCAG 
ACCTGAAGCC 
GCCGGCCGCT 
AGCTCTTGAT 
TCACGCTTAT 
CCCGTCGGGC 
ACA3CGTCGA 
TAACAGACAA 
GTCTAGGCCG 
TTGCTGATCT 
CAGACTACAC 
AACTTAATGT 
TCCGGGTTGA 
GTACGACATT 
AGGATCTGAG 
AGCATGAGCA 
TCAGGCTTGG 
GCTGGTCA.GT 
AACAGGATCC 
GGCAAAAGAC 
ACCTTGCCGA 
CCGGCCGCGC 



AGGCCTGACC 
GTACCTGGCC 
CGGCGATATG 
ACTGGCCGAT 
GGAGGAAGAA 
CAGGATCCAA 
GTACCGATAT 
TCCCGAAGCT 
ACTCAAGGAG 
CGTTGTAGAA 
ATCGATGGAT 
TCTCTCCGCA 
CTGGGCCCGT 
CGCTACGGAG 
CAAGAGCTTG 
CCGCGAGGGG 
CGACCGGATG 
CAAAATCACA 
GGCTACGGTC 
TTTCTTCAAT 
TCAACTCAAT 
CGCCGCTGTG 
AGGGCCCGTT 
CGGGGCGACG 
TGCCCTCGCT 
CCACGAGGAA 
AATCATCGAC 
GACCACGCTC 
AGAAGAAGAG 
AGAGGTTCTG 
TATGGAGCAA 
7GAAGAGCAG 
CAGTACCCCT 
CAATGAGCTA 



TCAGATATGG 
GAGACGATCA 
CGGTCCTGGG 
AATGCCAGCA 
TTACTCGTTG 
CACGTGGAGA 
GCCGCCGTGC 
TGGATCAACT 
AATGAGAACG 
CGTCACATTG 
GGCAGCGACT 
AGTGACCTGT 
CAACATTCGC 
GACAGTCGCG 
ACGAATCAAC 
CTGCAAACCC 
CCACTCAATG 
CAGAAGAGCA 
TGCAAGGGCT 
CTGGGCGGTC 
ACACGTATCT 
ATCCAACGCA 
CCCCAGTCAT 
TGGTATCTTA 
ACCGCGATAT 
GATGGCGTCG 
CTGTCCACTC 
TTCGACCTCG 
CATATACTTT 
TTTGATGAGA 
ATCTTGCCTC 
GTTGCCGAGC 
GCGGAGCTGT 
CCCCTTACCA 



CCGTCATCAA 
AATCACTTGT 
CCATGAACAG 
AAGACCGCGT 
ACCCGGCCTT 
TCCTACCCAA 
TGCACATCTC 
TTGAGGGGTC 
CCGAGAGTGT 
TGCGGTCGCT 
GGATCTCGGC 
TCGACATTGC 
AGCACGGTGC 
TTTTGATCAA 
CGCTCCTGCC 
TGCTGCCTCC 
CTAACGGCAA 
AGCCGGTTGA 
TCACTGACGT 
ATTCACTGAT 
CGGTGAGAGA 
ACTCGGCACC 
TCGCACAGGG 
TGCCCCTTGG 
CAGCCCTGGA 
GTGTTCAAGT 
AGCCAAAGGA 
CAACCGAGCC 
CCATTGTTAT 
TGCACCGGTT 
TACCGATCCA 
ATCAGCGGCA 
TAACTGACCT 
TCGAGGGGCG 



CTCGGTGGCG 
CCAAGTCCCG 
GGACTTTGCT 
GAGACAGAAG 
CTTCACTGCC 
GCGAATGAAG 
CGACGAGCCC 
TCGATTGACC 
GGCCATCAGC 
TGACCAGGAA 
CGTGCGCACA 
AGAAGATGCT 
CCTGGATGCC 
GTTCCCTACA 
AGCCCAGAGT 
CTACATGATC 
AGTCGACCGG 
GGACATCGTT 
GCTGGGCGTT 
GGCAACGAAG 
CGTCTTCGAT 
TCAC3AGCCG 
CCGCCTTTGG 
TATCCGCCTT 
GCAACGTCAC 
CGTGCAAGAC 
CGCCTATCTT 
CGGTTGGCGC 
GCACCATATC 
CTACTCGAGT 
GTACCGCGAC 
GTTGGACTAC 
CCCTCGACCT 
TCTTCATGAT 



CAGTACTTCC 
GGCATGAAGC 
GCCGCTCGCG 
ATGATGGAAT 
TTGGCGAGCC 
GCTACAAATG 
CTTCCTATCT 
CGAGAGGCGC 
AACATTCCTT 
GACGCCAATG 
AGAGCTCAGC 
GGGTTCCGTG 
GTGTTCCACC 
GATCACCAAG 
CGCCGAGCCG 
CCCTCGCAAA 
AGAGAACTCG 
CCCCCTCGGA 
GAAGTTGGGA 
TTGGCGGCGC 
CAACCAGTCG 
ATCAAGCCAG 
TTCCTCGATC 
CACGGATCTC 
GAGCCTCTCC 
CACCGGCCCA 
GCCGTGTTGA 
GTCGCTCTGA 
ATATCTGACG 
GCGCTTAGGC 
TTTGCAGCAT 
TGGACGGAGC 
TCTATCTTGT 
AAATTGCGCG 



16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940. 

18000 

18060 

18120 

13180 

18240 

18300 

18360 
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CTTTCTGCCG 


AGTTCACCAA 


GCCACGCCGT 


TCGTCATTCT 


CCTTGCGGCC 


CTGAGGGCAG 


18420 


5 


CACATTACCG 


TCTTACAGGG 


GCTGAAGATG 


CCACGCTTGG 


AACGCCAATT 


GCTAACCGTA 


18480 




ACCGACCGGA 


ACTTGAGAAC 


ATGATCGGCT 


TCTTTGTTAA 


CACGCAGTGT 


ATGCGTATTG 


18540 




CTATTGAGGA 


GAATGATAAC 


TTCGAGTCCC 


TCGTCCGCCG 


AGTACGATCG 


ACTGCAACAT 


18600 


10 


CCGCTTTCGC 


GAACCAAGAC 


GTTCCATTTG 


AATCCATTGT 


CTCGTCTCTT 


CTGCCTGGAT 


18660 




CCAGGGACGC 


CTCGCGCAAT 


CCATTA3TGC 


AGGTCATTCT 


GGCTGTTCAC 


TCTCAGCAGG 


18720 




ATTTGGGTAA 


ACTCACCTTA 


GAAGGCCTCA 


GAGATGAAGC 


TGTTGACTCG 


GCTATCTCAA 


18780 


IS 


CCCGCTTCGA 


TGTGGAGTTT 


CATCTTTTCG 


AGCACGCAGA 


CAGGCTCAGC 


GGTAGCGTGC 


18840 




TTTACGCAAA 


GGAACTGTTC 


AAGCTGCGCA 


CGATCGAATC 


AGTGGTCTCC 


GTTTTCCTGG 


18900 




AGACTCTGCG 


GCGGGCTCTC 


GACCAGCCAC 


TGACTCCTCT 


CGCTGTCTTG 


CCGCTGACTG 


18960 


20 


ATGGAGTCGG 


GGAGATCGCA 


AGCAAGGGGC 


TCCTTGATGT 


GCCCAGGACA 


GACTATCCAC 


19020 


GAGATGCAAA 


TATCGTTGAG 


GTCTTCCAAC 


AGCACGTTCG 


CGCTACCCCG 


GACGCCATCG 


19080 




CCGTCAAGGA 


CGCTACTTCC 


ATACTGACGT 


ATGCTCAGCT 


AGATCAGCAG 


TCTGATCGAC 


19140 




TTGCTATCTG 


GTTGAGTCGC 


CGGCACATGA 


TGCCCGAAAC 


GCTGGTGGGT 


GTCCTTGCGC 


19200 


25 


CGCGGTCATG 


CGAGACCATT 


ATCGCAATGT 


TTGGCATTAT 


GAAGGCCAAC 


CTCGCCTACT 


19260 




TGCCTTTGGA 


TATAAACTCG 


CCTGCTGCTC 


GACTCCGCAG 


CATTCTCTCA 


GCCGTAGATG 


19320 




GGAACAAGCT 


TGTTTTGCTC 


GGCAGTGGTG 


TCACAGCCCC 


CGAGCAAGAG 


AACCCCGAGG 


19380 


30 


TGGAAGCTGT 


TGGTATTCAA 


GAGATCTTGG 


CCGGCACTGG 


ACTGGACAAG 


ACACAAGGCA 


19440 




GC AACGCCCG 


ACCCTCGGCA 


ACGAGCCTTG 


CTTATGTTAT 


CTTCACCTCT 


GGTTCAACCG 


19500 




GCAAGCCCAA 


GGGCGTCATG 


GTCGAACATC 


GTAGCGTTAC 


GAGATTGGCA 


AAGCCCAGCA 


19560 


35 


ACGTTATCTC 


CAAGCTACCA 


CAAGGAGCCA 


GGGTGGCGCA 


CCTCGCCAAC 


ATTGCCTTCG 


19620 




ATGCCTCGAT 


CTGGGAAATT 


GCCACAACTC 


TTCTGAATGG 


AGCCACGCTT 


GTTTGTCTCG 


19680 




ACTATCACAC 


CGTTCTCGAC 


TGCAGGACTC 


TCAAAGAAGT 


CTTCGAAAGG 


GAAAGCATTA 


19740 


An 


CGGTTGTCAC 


ACTGATGCCT 


GCGCTCCTCA 


AGCAGTGCGT 


GGCCGAAATA 


CCCGAGACCC 


19800 


TCGCACACCT 


CGACCTCCTG 


TACACCGGTG 


GAGATCGAGT 


GGGTGGTCAC 


GATGCTATGC 


19860 




GGGCTCGCTC 


GCTAGTCAAG 


ATCGGCATGT 


TCAGCGGTTA 


CGGCCCTACG 


GAGAACACCG 


19920 




TCATCAGCAC 


CATCTACGAA 


GTTGA7GCAG 


ACGAGATGTT 


TGTGAATGGT 


GTGCCTATCG 


19980 


45 


GCAAGACTGT 


AAGCAACTCT 


GGGGCATATG 


TTATGGACAG 


GAATCAGCAG 


CTGGTGCCTA 


20040 




GTGGCGTGGT 


AGGTGAGCTT 


GTGGTCACTG 


GCGATGGCCT 


TGCTCGCGGA 


TACACTGATC 


20100 




CATCCCTAAA 


CAAGAACCGC 


TTCATTTACA 


TCACTGTCAA 


TGGAGAGAGT 


ATCAGGGCAT 


20160 


50 


ATCGGACTGG 


CGATCGGGTG 


AGGTACCGGC 


CTCATGATCT 


GCAGATTGAA 


TTCTTTGGCC 


20220 




GCATGGACCA 


GCAGGTCAAG 


ATCCGTGGCC 


ATCGAATCGA 


GCCGGGAGAG 


GTGGAGAGCG 


20280 




CATTGCTCAG 


TCACAACTCG 


GTACAAGACG 


CCGCGGTCGT 


CATTTGCGCG 


CCAGCAGATC 


20340 
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AAGACTCAGG 
CCCAGGAGGA 
CATATTCAGA 
CTTCTATGTA 
CCATGCGTAT 
CCGGCATGGT 
AGCCTTCAAA 
AGGATGGCCG 
TTCAACCACG 
TCTTCAGGGT 
ACATGAGAAC 
GCGAGAAGGC 
AGGAACTTCT 
TCAAGCACGT 
GATATGCCGC 
CTCCCAACGC 
TGCTGAAGGA 
CCATTGTTGA 
ACTCACTGGA 
CACTCGATGC 
GTTGGGCGCG 
CGCCCAAGGA 
TGAACACCTT 
TCCGCGAGAA 
TTGATCAGAT 
CTATCGTGGC 
AGGCTATTCT 
ACTTCTTTGA 
GCCGCCTTGA 
TTGCGGCGTC 
CCGGGCCGGC 
TTGGCGCGAC 
CTGCGTTGTC 
TTGAAGAAAG 



CGCGGAAATG 
AGAAGCAGTC 
AGTCAAGGAC 
TGACGGCAGC 
GATACTCGAC 
CATGTTCAAC 
GTCGGCAGCC 
GTCAATAGTC 
CCTCGTCGTT 
TGTGGAGGCC 
CAACGCCATC 
AAACAAGCGC 
GACGGACCCT 
CGAAATTCTC 
AGTACTACAT 
CTGGATAGAC 
GCACAAGGAT 
GCGGTTTGTC 
CGGATCAGCT 
AATGGATGTC 
TCAATGGTCC 
GGGTGCTCGC 
AACGAACCGT 
GCTGCAGACC 
GCCTGTCAAC 
CCCGAAGCCA 
GAGAGACGAA 
TCTCGGCGGG 
TGCCCATATT 
CATCCAGAGA 
TGAACAGTCA 
CTGGTACCTG 
TGCCGCACTC 
CGACGGCGTT 



GTGGCATTCG 
GATCAGGTTC 
ATTCGACAGT 
GAGATCGACA 
GCCAGAGAGC 
CTTGCCAAGT 
CAATTCGTCA 
CATGTGGGCA 
ATCAACTCAG 
CTTGTACAGA 
AACAGAGACT 
CTGGTCCGCC 
GCATTCTTTA 
CCCAAGACCA 
GTGCGTGGCT 
TTTGCGGCAG 
GCCGGGACCG 
AACAAGTCAC 
TGGGTTGCAG 
AAGGAGATTG 
CAGAATGGTG 
ACACTTATTG 
CCCCTGAACA 
CTCCTGCCGC 
AACAACGGCA 
AGGTCAGCGG 
TTCGAGGACG 
CACTCACTTA 
TCCATCAAAG 
GAATCGGCTC 
TTTGCCCAAG 
ATGCCTTTAG 
TTCGCCTTG3 
GGCGTGCAAA 



TTGCCGCCCG 
AAGGGTGGGA 
CAGAAGTCGG 
AGACAGATAT 
CGGGCCACGT 
GTCCTGGTCT 
ATGATGCAGC 
CGGCGACAGA 
TAGCGCAGTA 
TCCCAAGCGT 
TCGTCGCAAG 
AGATGATCTA 
CATCTTTGCG 
TGAAGGCTAC 
CGAGAGAACA 
ACGGTCTCGA 
TCGCTATCGG 
TGAGCGAGGA 
CCGTCCGGAT 
CTCAGGAGGC 
CGCTCGATGC 
AGTTCCCGAC 
GCATTCAAAG 
CTTACATGAT 
AGATTGACCG 
CTACTCGGGT 
TGCTCGGAAC 
TGGCCACGAA 
ATGTCTTTGA 
CTCATGAACC 
GTCGCCTATG 
CCATGCGTAT 
AGAGACGACA 
TTGTTGGAGA 



GAATACCGAA 
GACGCACTTC 
TAACGACTTC 
GCACGAGTGG 
ACTGGAGATC 
GCAGGGCTAC 
CCAGTCATTC 
CATCAACAAG 
TTTCCCCACG 
GGAACGCATC 
CCGAGCATTG 
TGAGCTCGAA 
TACGCGCTTG 
CAACGAGCTC 
ATCAACTATA 
CCGGCAGACC 
TAATATCCCG 
TGATATGGAG 
GGCCGCTCAA 
GGGATACCAG 
CATCTTCCAT 
GGATTACGAA 
CCGCCGTCTT 
CCCATCGCGC 
CAAGGAGCTT 
AGCCCCCCGC 
AGAAGTCAGC 
GCTCGCCGCC 
TCAGCCGGTG 
GATTCCGCAA 
GTTCCTTGAC 
CCGTGGCCAG 
TGAGACCTTG 
GGCTCGCAAC 



GACGAAGACA 
GAAACGGCCG 
ATGGGCTGGA 
CTCAACGACA 
GGTACCGGCA 
GTCGGTTTCG 
CCGGCTCTGA 
GCTGGGCCGA 
CCAGAGTACC 
GTCTTTGGTG 
CACACCCTCG 
GCCAACGAAG 
GGTGAGAAGA 
AGCAAGTACC 
CACCAAGTCT 
CTCATCAACT 
TACAGCAAGA 
GAAGGCCAGA 
AGCTGCCCAT 
GTCGAAGTCA 
CACTTCGAAC 
GGCCGGAATG 
GGGACGCAGA 
ATCATGGTCC 
GTGCGGAGAG 
AATGAGATCG 
GTGCTGGATA 
CGCGTTAGCC 
CTGGCGGATC 
AGGCCTTACA 
CAGCTTAACC 
TTGAGGGTAG 
AGAACCACCT 
TCAGACCTTC 



20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 
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GGGTTCATGA 


CGTTTCTACC 


GGAGACGACG 


GGGAGTACCT 


TGAGGTACTC 


AGGAGGGAAC 


22440 


5 


AGACTGTGCC 


CTTCGACCTC 


TCGTCAGAGC 


CTGGCTGGAG 


GGTTTGCCTG 


GTCAAAACGG 


22500 




GCGAAGAGGA 


TCATGTACTA 


TCCATCGTCA 


TGCACCATAT 


TATTTACGAC 


GGCTGGTCCG 


22560 




TGGACATTCT 


CCGCGGAGAG 


TTGGGTCAAT 


TCTATTCCGC 


TGCCCTACGC 


GGCCAGGACC 


22620 


10 


CCTTGCTGCA 


CGCCAACCCC 


CTTCCTATTC 


AATACCGGGA 


TTTCGCAGCG 


TGGCAAAGGG 


22680 


AGGCAAAACA 


GGTCGAAGAG 


CATCAACGTC 


AACTTGGGTA 


CTGGTCGAAA 


CAGCTCGTTG 


22740 




ACAGCACTCC 


AGCTGAGCTC 


TTGACGGATC 


TGCCTCGCCC 


GTCTATCTTG 


TCCGGTCGTG 


22800 




CCGGGTCGGT 


GGATGTCACG 


ATCGAAGGCT 


CTGTTTACGG 


AGCCCTTCAG 


TCATTCTGCC 


22860 


15 


GCACGCGTTC 


GGTAACCACA 


TTCGTTGTGC 


TTCTGACTGT 


GTTCCGGATT 


GCGCATTTCC 


22920 




GTCTCACTGC 


CGTCGATGAC 


GCGACTATCG 


GCACGCCTAT 


CGCAAACCGT 


AACCGTCCTG 


22980 




AGCTGGAGAC 


GTTGGTTGGC 


TGCTTTGTAA 


ACACGCAATG 


TATGCGTATC 


AGCATAGCCG 


23040 


20 


ACGACGATAA 


CTTTGAAGGT 


CTTGTGCGAC 


AGGTGCGTAA 


TGTTGCAACG 


GCAGCTTACG 


23100 




CGAACCAAGA 


TGTTCCTTTC 


GAACGAATCG 


TGTCCGCCCT 


AGTTCCAGGG 


TCGAGAAACA 


23160 




CATCCCGCAA 


CCCCCTGGTT 


CAGCTCATGT 


TTGCTGTCCA 


GTCCGTGGAA 


GATTATGACC 


23220 


25 


AGGTCCGACT 


CGAGGGCTTG 


GAGAGTGTCA 


TGATGCCTGG 


AGAAGCCTCC 


ACACGCTTTG 


23280 


ATATGGAATT 


CCACCTCGTC 


CCCGGCGATC 


AGAAGCTTAC 


GGGCAGCGTT 


CTTTACTCCT 


23340 




CAGACCTTTT 


TGAGCAAGGC 


ACTATCCAGA 


ACTTCGTCGA 


CATCTTCCAA 


GAATGTCTTC 


23400 




GCTCCGTCCT 


GGACCAGCCA 


TTGACCCCGA 


TCTCCGTTCT 


TCCCTTCAGC 


AACGCCATTT 


23460 


30 


CAAACCTCGA 


GAGCTTGGAT 


CTCCTGGAGA 


TGCCGACCTC 


AGACTACCCC 


CGCGATCGGA 


23520 




CAGTCGTTGA 


TCTCTTCCGA 


GAGCAAGCGG 


CAATCTGCCC 


CGACAGCATC 


GCCGTCAAAG 


23580 




ACTCATCGTC 


GCAACTGACA 


TATGCTCAAC 


TGGATGAGCA 


ATCCGACCGT 


GTTGCCGCCT 


23640 


35 


GGCTGCACGA 


GCGCCACATG 


CCGGCGGAGT 


CTTTGGTCGG 


TGTACTGTCG 


CCACGGTCGT 


23700 




GCGAGACTA? 


CATCGCGTAC 


TTTGGCATCA 


TGAAGGCAAA 


CCTGGCTTAC 


CTGCCGTTGG 


23760 




ATGTTTATGC 


GCCAGATGCC 


CGTCTGGCGG 


CTATCCTGGA 


TACAGTCGAA 


GGCGAAAGAC 


23820 


40 


TGCTTCTGTT 


GGGCGCAGGT 


GTCCCTCAGC 


CCGGCATCCA 


GATCCCTCGC 


CTGTCAACAG 


23880 




CATACATCGC 


GGAAGCACTG 


AGCCATGCCA 


C3ACCGTCGA 


TGTCACTTCC 


ATCCCACAGC 


23940 




CCTCGGCCAC 


CAGCCTTGCG 


TACGTCATTT 


TCACTTCGGG 


ATCTACTGGC 


AAGCCCAAGG 


24000 


45 


GTGTCATGAT 


CGAGCATCGC 


GGCATCGTGC 


GCCTGGTTAG 


AGATACCAAC 


GTCAACGTGT 


24060 


TCCCGGAATC 


GGGATCAGCT 


TTGCCTGTCT 


CTCACTTCTC 


CAACCTCGCC 


TGGGATGCGG 


24120 




CGACTTGGGA 


GATCTACACT 


GCCGTGCTCA 


ATGGAGGGAC 


CGTTGTGTGC 


ATTGACCGAG 


24180 




ACACCATGCT 


GGACATAGCC 


GCGTTGAACT 


CAACATTCCG 


GAAGGAGAAC 


GTTCGGGCTG 


24240 


50 


CCTTCTTCAC 


CCCTGCCTTC 


CTGAAGCAAT 


GCC7TGCCGA 


GACGCCAGAG 


CTGGTCGCCA 


24300 




ACCTAGAGAT 


CCTTCACACG 


GCAGGCGATC 


GTCTCGATCC 


TGGAGATGCC 


AACCTGGCTG 


24360 
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GAAAGACAGC CAAGGGTGGT ATCTTCAACG TCCTGGGTCA CACAGAGAAC ACTGCCTATA 24420 

GTACCTTCTA CCCTGTGGTT GGTGAGGAGA CGTTCGTCAA TGGTGTCCCC GTCGGTCGCG 24480 

GCATCAGCAA CTCCCATGCA TATATCATCG ACCGACACCA GAAGCTCGTA CCCGCAGGTG 24540 

TCATGGGAGA GCTTATTCTC ACTGGCGACG GTGTTGCGCG AGGTTACACC GACTCTGCGC 24600 

TGAACAAGGA TCGATTCGTT TACATCGATA TCAACGGCAA AAGCACATGG TCGTACCGCA 24660 

CAGGCGATAA GGCACGTTAT CGACCAAGGG ACGGCCAGCT GGAATTCTTT GGCCGCATGG 24720 

ACCAAATGGT CAAGATCCGT GGTGTTCGAA TCGAACCCGG CGAAGTTGAG CTCACCCTGC 247 80 

TCGACCATAA GTCCGTCCTG GCCGCGACTG TGGTGGTCAG AAGACCACCC AATGGCGACC 24840 

CGGAGATGAT TGCCTTCATC ACCATCGACG CTGAAGACGA CGTGCAAACT CACAAGGCCA 24900 

TTTACAAGCA CCTCCAGGGT ATCTTGCCCG CGTACATGAT TCCCTCACAC CTTGTCATCC 24960 

TTGACCAGAT GCCGGTCACC GACAACGGTA AGGTCGATCG CAAGGATCTC GCACTCAGAG 2 5020 

CGCAGACAGT ACAGAAACGC AGGTCTACCG CTGCAAGGGT ACCACCTCGT GACGAGGTGG 2 5080 

AGGCTGTTCT TTGCGAAGAG TACAGCAACT TACTTGAAGT TGAGGTTGGC ATTACCGACG 2 5140 

GATTCTTCGA CCTGGGTGGA CATTCGCTCC TCGCCACCAA GCTTGCGGCC CGCCTAAGCC 25200 

GACAACTCAA CACTCGCGTG TCTGTCAAGG ACGTCTTTGA CCAGCCAATA CTCGCTGACC 252 60 

TCGCTGATAT CATCCGCCGC GGTTCCCATC GCCACGATCC GATTCCTGCC ACTCCATACA 25320 

CGGGCCCTGT CGAACAGTCG TTCGCTCAGG GCCGCCTGTG GTTCTTGGAA CAACTGAACC 25380 

TAGGTGCCAG CTGGTACTTG ATGCCCTTCG CGATCCGGAT GCGTGGGCCC CTCCAGACAA 25440 

AGGCGCTGGC TGTCGCACTG AACGCCTTGG TGCACCGGCA CGAGGCGTTG CGGACGACTT 25500 

TCGAGGACCA CGATGGGGTT GGTGTTCAGG TCATTCAACC AAAGTCAAGC CAAGACCTGC 25560 

GGATCATCGA CCTATCAGAC GCTGTAGATG ATACTGCCTA TCTCGCCGCG CTCAAGAGGG 25620 

AACAGACAAC AGCCTTCGAC CTGACCTCTG AACCAGGGTG GAGAGTGTCA CTCTTACGCC 25680 

TAGGTGACGA TGATTACATC CTTTCTATCG TTATGCACCA CAT7ATCTCT GATGGCTGGA 25740 

CTGTTGATGT GCTACGACAA GAACTCGGCC AGTTCTATTC AGCTGCGATC AGGGGTCAGG 25800 

AGCCTTTATC GCAGGCCAAG TCCCTCCCTA TTCAATACCG CGACTTTGCT GTTTGGCAGA 25860 

GGCAGGAGAA CCAGATCAAG GAGCAAGCGA AGCAGCTCAA GTATTGGTCA CAGCAGCTCG 25920 

CAGATAGCAC CCCCTGCGAG TTCCTAACGG ACCTCCCTCG GCCCTCTATC CTGTCTGGTG 25980 

AAGCTGACGC CGTTCCTATG GTGATTGATG GCACGGTGTA TCAGCTCCTT ACTGATTTCT 26040 

GCCGGACGCA CCAAGTCACA TCGTTCTCAG TCCTGCTCGC AGCCTTCCGC ACTGCCCACT 26100 

ACCGCCTTAC CGGGACACTC GACGCGACGG TTGGCACACC AATCGCTAAC CGGAACCGGC 26160 

CAGAGTTGGA AGGTCTGATC GGTTTCTTCG TTAACACGCA GTGTATGAGG ATGGCAATCA 2 6220 

GTGAGACTGA AACCTTTGAG TCACTAGTCC AGCAGGTTCG CTTGACTACG ACAGAAGCCT 2 6280 

TTGCGAACCA AGATGTGCCG TTTGAGCAGA TTGTGTCAAC CCTTCTTCCT GGGTCACGAG 2 6340 

ATACGTCAAG GAACCCGCTT GTGCAGGTCA TGTTTGCCCT GCAATCACAG CAAGACCTCG 264 00 
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GAAGAATCCA 


GCTGGAAGGT 


ATGACGGACG 


AAGCTCTGGA 


AACGCCGCTG 


TCGACGAGAC 


26460 


5 


TCGACCTTGA 


GGTTCACCTC 


TTCCAGGAGG 


TTGGAAAGCT 


GAGCGGCAGC 


CTCTTGTACT 


26520 




CCACGGACCT 


CTTCGAGGTC 


GAGACGATTC 


GTGGAATCGT 


TGATGTGTTC 


CTGGAGATCT 


26580 




TGCGCCGCGG 


CCTTGAGCAA 


CCCAAGCAGC 


GACTGATGGC 


CATGCCAATT 


ACCGATGGCA 


26640 


10 


TCACAAAGCT 


ACGCGACCAG 


GGTCTCCTAA 


CAGTGGCGAA 


ACCAGCCTAC 


CCTCGCGAAT 


26700 




CGAGTGTCAT 


AGATCTGTTC 


AGACAGCAGG 


TTGCCGCCGC 


ACCGGATGCC 


ATCGCTGTGT 


26760 




GGGATTCCTC 


CTCAACATTG 


ACCTATGCCG 


ACCTCGATGG 


GCAATCGAAC 


AAGCTCGCCC 


26820 


15 


ACTGGCTGTG 


CCAGCGCAAT 


ATGGCCCCAG 


AGACCTTGGT 


AGCTGTATTC 


GCGCCACGCT 


26880 




CATGCCTCAC 


CATCGTCGCA 


TTCCTCGGTG 


TTTTGAAGGC 


TAATCTGGCC 


TACCTGCCCT 


26940 




TGGATGTCAA 


TGCGCCTGCT 


GCTCGTATCG 


AGGCTATCCT 


GTCAGCAGTA 


CCAGGCCACA 


27000 


20 


AGCTGGTCCT 


GGTGCAGGCT 


CATGGGCCCG 


AGCTTGGCCT 


GACGATGGCT 


GATACTGAAC 


27060 


TGGTGCAGAT 


CGACGAGGCA 


CTTGCATCCA 


GTTCATCCGG 


TGACCATGAG 


CAGATCCATG 


27120 




CGTCCGGCCC 


TACTGCCACA 


AGTCTTGCCT 


ACGTGATGTT 


TACGTCAGGG 


TCTACTGGGA 


27180 




AACCAAAGGG 


TGTCATGATC 


GACCACCGCA 


GCATCATTCG 


ACTTGTCAAG 


AACAGCGATG 


27240 


25 


TTGTTGCCAC 


TCTGCCTACG 


CCAGTCCGGA 


TGGCGAATGT 


ATCAAACCTT 


GCCTTCGACA 


27300 




TCTCGGTGCA 


AGAAATCTAC 


ACGGCGCTCC 


TAAACGGTGG 


CACTCTGGTC 


TGCTTGGACT 


27360 




ATCTGACGCT 


ATTGGACAGC 


AAAATTCTTT 


ATAACGTTTT 


TGTGGAAGCA 


CAGGTCAACG 


27420 


30 


CCGCCATGTT 


CACGCCGGTT 


CTCCTCAAGC 


AATGTCTTGG 


AAACATGCCC 


GCCATCATCA 


27480 




GTCGCCTGAG 


TGTTCTCTTT 


AACGTTGGTG 


ACAGGCTGGA 


TGCCCACGAT 


GCTGTGGCTG 


27540 




CATCAGGCCT 


GATCCAAGAC 


GCCGTATACA 


ACGCCTACGG 


TCCCACGGAG 


AACGGCATGC 


27600 


35 


AGAGTACGAT 


GTACAAGGTC 


GACGTCAATG 


AGCCTTTCGT 


CAACGGCGTC 


CCGATCGGTC 


27660 




GATCCATCAC 


CAACTCTGGG 


GCTTACGTCA 


TGGACGGCAA 


TCAACAGCTC 


GTATCTCCTG 


27720 




GTGTGATGGG 


AGAAATTGTC 


GTTACCGGTG 


ATGGTCTTGC 


CCGTGGCTAT 


ACAGACTCAG 


27780 


40 


CCCTAGACGA 


GGACCGGTTT 


GTTCACGTCA 


CGATCGATGG 


TGAGGAAAAT 


ATCAAGGCAT 


27840 


ACCGAACCGG 


TGATCGAGTC 


CGCTACCGGC 


CCAAGGACTT 


TGAGATTGAA 


TTCTTCGGCC 


27900 




GTATGGATCA 


ACAGGTGAAG 


ATTCGTGGTC 


ACCGCATTGA 


GCCAGCAGAA 


GTGGAACATG 


27960 




CACTGCTCGG 


CCACGACTTG 


GTTCACGATG 


CAGCTGTCGT 


GCTTCGAAAG 


CCAGCAAATC 


28020 


45 


AAGAACCAGA 


GATGATTGCT 


TTCATCACCA 


GCCAGGAAGA 


CGAGACTATC 


GAGCAGCATG 


28080 




AGTCAAACAA 


GCAGGTCCAA 


GGCTGGGGAG 


AGCATTTCGA 


CGTAAGCAGG 


TATGCTGATA 


26140 




TCAAGGATCT 


CGACACTTCT 


ACCTTTGGTC 


ACGACTTTTT 


GGGATGGACA 


TCTATGTATG 


23200 


SO 


ACGGAGTTGA 


CATTCCTGTC 


AACGAGATGA 


AAGAGTGGCT 


TGATGAAACT 


ACGGCCTCCC 


28260 




TCCTAGACAA 


CCGCCCACCT 


GGTCATATCC 


TCGAGATCGG 


AGCCGGAACT 


GGCATGATTC 


28320 




TATCTAACCT 


GGGCAAAGTC 


GACGGCCTAC 


AGAAGTATGT 


CGGTCTTGAC 


CCGGCTCCCT 


28380 
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CAGCCGCAAT 


CTTTGTCAAC 


GAAGCCGTCA 


AGTCTCTGCC 


AAGTCTAGCC 


GGTAAGGCCC 


28440 


5 


GGGTACTTGT 


TGGAACTGCC 


CTGGATATCG 


GTTCTCTGGA 


CAAGAATGAG 


ATCCAACCTG 


28500 




AGCTTGTGGT 


TATCAACTCC 


GTGGCCCAGT 


ACTTCCCCAC 


ATCAGAGTAC 


TTGATCAAGG 


28560 




TGGTCAAAGC 


TGTTGTGGAA 


GTGCCCAGCG 


TCAAGCGTGT 


TTTCTTTGGC 


GATATCAGAT 


28620 


10 


CCCAGGCCCT 


TAACAGGGAC 


TTCCTTGCAG 


CTCGTGCCGT 


TCGTGCGTTG 


GGTGACAATG 


28680 


CTAGCAAAGA 


GCAGATCCGG 


GAAAAGATCG 


CAGAGCTCGA 


AGAGAGCGAA 


GAAGAACTTC 


28740 




TCGTGGACCC 


AGCCTTCTTC 


GTGAGCTTGA 


GAAGCCAGCT 


GCCCAACATC 


AAGCACGTTG 


28800 




AGGTCCTGCC 


CAAGCTGATG 


AAGGCCACCA 


ACGAGCTGAG 


CTCGTACAGA 


TATGCTGCGG 


28860 


15 


TTCTACACAT 


CAGCCACAAC 


GAAGAGGAGC 


AGCTGCTCAT 


ACAGGATATC 


GATCCCACAG 


28920 




CATGGGTTGA 


CTTTGCAGCA 


ACGCAAAAGG 


ACTCTCAAGG 


TCTGAGAAAC 


CTTCTACAAC 


28980 




AAGGACGAGA 


TGATGTGATG 


ATCGCGGTCG 


GGAACATCCC 


GTACAGCAAG 


ACCATAGTGG 


29040 


20 


AGCGACACAT 


TATGAACTCT 


CTTGACCAAG 


ATCACGTCAA 


CTCACTCGAC 


GGGACATCCT 


29100 




GGATCTCAGA 


TGCTCGATCA 


GCCGCTGCAA 


TCTGCACTTC 


GTTCGACGCA 


CCCGCCCTCA 


29160 




CGCAGTTGGC 


CAAGGAGGAG 


GGATTCCGGG 


TAGAGTTGAG 


CTGGGCGCGA 


CAGAGATCTC 


29220 


25 


AAAACGGCGC 


CCTCGATGCC 


GTTTTCCACC 


GTCTTGCAAC 


CGATGCAAAT 


TGCGAGCGCA 


29280 




GTCGTGTCCT 


GGTACACTTC 


CCTACCGACC 


ATCAAGGTCG 


ACAACTTCGA 


ACCCTGACGA 


29340 




ACCGGCCACT 


CCAGCGAGCT 


CAGAGCCGCC 


GTATCGAGTC 


ACAAGTCTTC 


GAGGCACTGC 


29400 


30 


AGACAGCACT 


GCCGGCCTAC 


ATGATCCCAT 


CGCGCATTAT 


CGTGCTCCCG 


CAGATGCCGA 


29460 


CCAACGCCAA 


CGGCAAAGTG 


GACAGGAAAC 


AGCTCGCTCG 


CCGCGCGCAG 


GTTGTGGCCA 


29520 




AGAGAAAGGC 


AGTGTCAGCG 


CGCGTTGCGC 


CTCGTAATGA 


CACCGAGATA 


GTTCTTTGCG 


29580 




AGGAATACGC 


AGATATCCTA 


GGAACTGAAG 


TTGGCATCAC 


GGACAACTTC 


TTCGACATGG 


29640 


35 


GCGGGCATTC 


ACTCATGGCT 


ACGAAGCTCG 


CAGCCCGACT 


AAGCCGGCGA 


CTAGATACCC 


29700 




GGGTTACGGT 


CAAGGAGGTG 


TTCGATAAAC 


CCGTCCTGGC 


TGACCTCGCT 


GCTTCGATCG 


29760 




AACAGGGCTC 


GACACCTCAT 


CTGCCTATTG 


CCTCATCGGT 


GTATTCCGGA 


CCGGTGGAGC 


29820 


40 


AGTCGTACGC 


CCAAGGTCGC 


TTGTGGTTCT 


TGGATCAGTT 


CAACCTCAAC 


GCGACGTGGT 


29880 




ATCACATGTC 


GCTGGCGATG 


AGGCTGCTCG 


GGCCGCTCAA 


CATGGACGCG 


CTGGACGTGG 


29940 




CCTTACGGGC 


GCTGGAGCAG 


CGACACGAGA 


CGCTTCGCAC 


AACCTTTGAG 


GCTCAAAAGG 


30000 


45 


ACATCGGCGT 


CCAGGTCGTT 


CATGAGGCCG 


GAA7GAAGAG 


GCTCAAGGTC 


CTTGACCTAT 


30060 


CAGACAAGAA 


CGAGAAGGAG 


CACATGGCCG 


TGCTAGAGAA 


TGAACAGATG 


AGACCGTTCA 


30120 




CTCTTGCTTC 


AGAGCCAGGC 


TGGAAGGGTC 


ATCTTGCTCG 


CCTTGGCCCC 


ACGGAGTATA 


30180 




TCCTCTCCCT 


CGTCATGCAT 


CACATGTTCT 


CAGACGGCTG 


GTCCGTTGAT 


ATCCTGAGAC 


30240 


50 


AGGAGCTCGG 


TCAATTCTAC 


TCAGCCGCTT 


TACGTGGCAG 


GGATCCGTTA 


TCTCAGGTCA 


30300 




AGCCCCTCCC 


AATACAATAT 


CGTGACTTTG 


CGGCTTGGCA 


GAAGGAAGCT 


GCCCAAGTTG 


30360 




CCGAGCATGA 


GAGGCAGCTC 


GCGTACTGGG 


AGAACCAGTT 


AGCTGACAGT 


ACTCCCGGTG 


30420 
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A3CTTCTGAC 


CG AC TTTCCC 


CGCCCACAGT 


TCCTGAGTGG 


GAAGGCTGGT 


GTCATCCCGG 


30480 


5 


TCACCATTGA 


GGGGCCGGTC 


TACGAGAAGC 


TTCTGAAGTT 


C T CO AAGGAG 


CGCCAGG^AA 


30540 




CTCTGTTCTC 


GGTGCTATTA 


ACAGCGTTCC 


GGGCCACACA 




A p Tr^r; THP A r; 


J V O U U 




AGGATGCTAC 


GATCGGTACC 


CCAATTGCAA 


ATCGCAACCG 


GCCAGAAPTP 

wWWcXWXUlW X w 


fiAf^ATATp A 
wAwwn X A^ w 


"3 n fin 

o u c o u 


10 


TTGGA^TCTT 


CGTCAACACC 


CAATGCATGC 


GTC T TC TCC T 


p G A T Arr'nt^r* 


Af^PaPATTPf^ 
nO^A^AX Xwvj 






AATCCCTAGT 


CCAGCATGTT 


CGGTCCGTGG 


CTACAGATGC 


CTATTCfAAT 

W X «• ^ A W wf^Zl * 


PAfViATATTP 
wAwOA X A X X w 


^ V r O U 




CCTTCGAACG 


GATCGTCTCG 


GCACTTCTCC 


C TGGCTCG AG 


AGATttCCTPA 


Pf5AAf5PPPAP 


^ u o ^ u 


15 


TAATCCAGCT 


TATGTTTGCC 


T TGC AC T C AC 


AGCCAGATCT 


CGGG AAr* AT* T 

wwww^^nw^* X 


APTPTPrtA 

AV» X W X VitOAAU 






GACTCGAGCA 


TGAGCGCCTG 


CCAACAAGCG 


TCGCAACACG 


1* T*TCGACATG 


GAGTTCCACP 

X X WW^IWV^ 


J V 7 O V 




TGTTCCAAGA 


GCCTAACAAG 


CTGAGTGGTT 


CAATACTCTT 


TGCCGATGAG 




31020 


20 


CTGAAACAAT 


CAACAGCGTC 


GTGACTGTGT 


TCCAGGAGAT 


ACTCCGACGC 


GGCCTCGACC 


31080 


AACCCCAAGT 


CTCCATTTCT 


ACTATGCCCC 


TGACTGATGG 


GTTGATTGAT 


CTCG AG AAAC 


31140 




TGGGCTTGCT 


GGAAATCGAG 


AGCAGCAACT 


TCCCTCGCGA 


CTACTCGGTT 


GTCGACGTCT 


31200 




TCCGACAGCA 


GGTGGCTGCC 


AATCCAAATG 


CGCCCGCTGT 


CGTGGATTCG 


GAGACATPPA 


312 60 


25 


TGAGCTACAC 


CTCGCTAGAT 


CAGAAGTCTG 


AGCAGATTGC 


TGCCTGGTTA 


CACGCTCAAG 


31320 




GCCTCCGCCC 


TGAGTCATTG 


ATCTGCGTGA 


TGGCGCCACG 


ATCTTTCGAA 


ACG AT PP TP T 

AW wA X Ww X W ^ 


'.J X J O V 




CCTTATTCGG 


TATCTTGAAG 


GCTGGCTACG 


CCTACCTGCC 


TCTGGATGTG 


AATTPPPPTfi 

AAX X ^ WWW X W 


314 40 


30 


CAGCTCGAAT 


CCAACCGATC 


C T ATCCGAGG 


TTGAAGGAAA 


AAGAPTfSflTA 


W X WW X AwwA X 






CAGGGATAGA 


CATGCCTCAA 


AGCGACCGAA 


TGGAT6TTGA 


AACCGCTCGA 


ATTP Af^f^APA 

AX X^AOOAL^A 


^ DU 




TCCTAACGAA 


CACAAAGGTC 


G AG AGATC TG 


« ^ w w\^n ^ w^\w 


w**wO^V^ AX 


^^P A APT- A/n*^^ 
V9w AAV« X AO wW 






TTGCCTATGT 


C A TCT TC ACC 


TCCGGGTCAA 


W A w\9W X WW 


W4WU9 WUWU ^ W 


A X w/\ X WljA.'iL7W 




35 


ATCGCAATAT 


TCTGCGCCTT 


GT C AAGC AGT 


C T AATGT T AC 


w X w X wAw^w X w 


ppf^p Af'zrsa TP 


J J. / 4U 




TGCGCATGGC 


ACATATCTCC 


AACCT AGCCT 


TTGACGCGTC 


^ A X W X ^JwwAkS 


A T 21 TTP A (^("^/^ 
aXAX XImxI^UU 


J 1.0UU 




CAATTTTGAA 


TGGCGGCGCC 


CTTATTTGCA 


TTGATTACTT 


^M««A^,^ XXX X 


GATAf^TP A Af^ 
WAX avjx.\^aaVj 




40 


CCCTCCGGAC 


GACATTCGAA 


AAAGCCAGGG 


TCAATGCTAC 


CCTATTCGCG 


CCCCjCC'V Trip 
www www X Xw^ 


31920 










TGTTTGAGGA 


TCTCAAAGTG 


CTCTATATCG 


31980 




GTGGCGACCG 


ACTCGATGCC 


ACCGACGCGG 


CCAAAATACA 


AGCCCTTGTG 


AAGGGCACGG 


32040 


45 


TCTACAACGC 


GTACGGGCCG 


ACAGAGAACA 


CAGTCATGAG 


CACGATCTAC 


AGGCTCACAG 


32100 


ATGGAGAGTC 


TTATGCTAAC 


GGTGTGCCAA 


TCGGCAATGC 


TGTGAGCAGC 


TCTGGCGCTT 


32160 




ATATCATGGA 


CCAAAAGCAG 


CGCCTCGTTC 


CTCCCGGTGT 


TATGGGAGAG 


CTCGTTGTGA 


32220 




GCGGCGATGG 


CCTCGCCCGT 


GGCTACACCA 


ACTCGACCCT 


CAATGCTGAT 


CGTTTCGTrG 


32280 


50 


ATATTGTCAT 


CAACGATCAA 


AAAGCCCGCG 


CATACCGGAC 


CGGAGATCGC 


ACTCGTTACC 


32340 




GGCCCAAGGA 


TGGTAGCATC 


GAGTTCTTCG ^ 


GCCGTATGGA 


TCAGCAAGTT 


AAAATCCGTG 


32400 
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GTCATCGAGT 


TGAGCCGGCC 


GAGGTCGAGC 


AAGCCATGCT 


CGGCAATAAG 


GCTATCCATG 


32460 


5 


ATGCAGCAGT 


TGTTGTTCAG 


GCGGTGGATG 


GCCAGGAAAC 


GGAGATGATC 


GGCTTTGTTT 


32520 




CCATGGCCAG 


CGACAGATTC 


AGCGAAGGGG 


AGGAGGAGAT 


CACCAACCAA 


GTCCAGGAGT 


32580 




GGGAAGACCA 


CTTCGAAAGC 


ACCGCCTACG 


CTGGCATTGA 


GGCCATCGAC 


CAGGCTACCC 


32640 


10 


TGGGACGCGA 


TTTCACTTCA 


TGGACCTCGA 


TGTACAACGG 


CAACTTGATT 


GACAAAGCCG 


32700 




AAATGGAGGA 


GTGGCTTGAC 


GATACAATGC 


AATCCCTCCT 


TGATAAGGAG 


GATGCCAGGC 


32760 




CGTGTGCTGA 


GATCGGAACA 


GGTACCGGCA 


TGGTTCTATT 


CAATTTGCCC 


AAGAACGATG 


32920 


15 


GCCTTGAGAG 


CTATGTCGGT 


ATAGAGCCTT 


CACGGTCTGC 


AGCCTTGTTC 


GTCGACAAAG 


32880 


CAGCCCAAGA 


TTTCCCAGGT 


CTGCAAGGAA 


AGACGCAAAT 


CCTTGTCGGC 


ACAGCCGAGG 


32940 




ACATCAAGCT 


GGTCAAGGAC 


TTCCACCCTG 


ACGTGGTTGT 


CATTAACTCG 


GTAGCCCAAT 


33000 


20 


ATTTCCCGAG 


CCGGAGCTAC 


CTTGTACAGA 


TAGCGAGCGA 


ACTGATTCAC 


ATGACCAGCG 


33060 


TCAAGACGAT 


CTTCTTTGGA 


GATATGCGAT 


CCTGGGCCAC 


CAACAGGGAT 


TTCCTCGTGT 


33120 




CCCGAGCTCT 


TTACACGCTA 


GGTGACAAGG 


CTACAAAGGA 


TCAGATTCGC 


CAGGAGGTTG 


33180 




CCCGACTTGA 


GGAGAATGAA 


GACGAGTTGC 


TTGTTGACCC 


AGCATTCTTC 


ACCTCTTTGA 


33240 


25 


CCAGCCAATG 


GCCCGGCAAG 


GTCAAGCATG 


TTGAGATCTT 


GCCGAAGCGG 


ATGAGGACGA 


33300 




GCAATGAACT 


AAGCTCGTAC 


CGATATGCTG 


C3GTGCTACA 


CATCTGCAGG 


GATGGGGAGG 


33360 




GTAGGAACAG 


ATAXGGCAGG 


CGTGTCCACT 


CAGTGGAAGA 


GAACGCCTGG 


ATCGACTTCG 


33420 


30 


CGTCGTCTGG 


CATGGATCGT 


CACGCCCTCG 


TTCAGATGCT 


CGATGAACGT 


AGAGACGCCA 


33480 




AGACTGTCGC 


CATCGGCAAC 


ATCCCTCACA 


GCAACACGAT 


CAACGAGCGA 


CACTTTACGA 


33540 




CATCCCTGGA 


TACTGAGGGA 


GAAGGCATTG 


CCCAAGATTC 


ACTGGATGGA 


TCCGCCTGGC 


33600 


35 


AATCGGCTAC 


GAAGGCAATG 


GCCGCGCGCT 


GTCCTTGCCT 


TTCCGTCACC 


GAACTGGTCG 


33660 


AGATCGGCCA 


AGCGGCAGGA 


TTCAGGGTCG 


AGGTCAGCTG 


GGCTCGTCAA 


CGATCCCAAC 


33720 




ATGGTGCACT 


GGACGTCGTC 


TTCCATCATC 


TTGAAGATGA 


CAGAGTAGGC 


CGCGTCTTGA 


33780 




TCAACTTCCC 


CACAGACTTC 


GAGCGTCTAC 


CCCCTAGCAC 


CGGCCTGACC 


AGTCGGCCGC 


33840 


40 


TGCAGCGCAT 


CCAGAACCGT 


CGGTTCGAGT 


CGCAGATCCG 


CGAACAGCTG 


CAAACACTGC 


33900 




TGCCACCTTA 


TATGGTTCCA 


TCACGGATCG 


TCGTGTTGGA 


GCGGATGCCT 


CTCAACGCAA 


33960 




ACAGCAAAGT 


CGACCGTAAA 


GAATTGGCAA 


GGAAGGCGAG 


GACCCTACAA 


ACCATCAAGC 


34020 


45 


CTTCTGCAAC 


GCGCGTGGCT 


CCTCGCAACG 


ATATTGAAGC 


CGTCTTGTGC 


GACGAGTTCC 


34080 




AGGCAGTTCT 


TGGTGTTACA 


GTCGGAGTCA 


TGGATAACTT 


TTTCGAGTTG 


GGCGGACACT 


34140 




CCCTGATGGC 


TACGAAACTG 


GCCGCCCGTC 


TCAGTCGCCG 


CCTCGACACC 


CGCGTCTCTG 


34200 


50 


TGAAGGATAT 


CTTCAACCAA 


CCAATCCTTC 


AAGATCTCGC 


GGACGTGGTC 


CAGACTGGCT 


34260 




CCGCTCCTCA 


TGAAGCTATT 


CCCTCCACGC 


CCTACTCTGG 


TCCCGTGGAG 


CAATCCTTCT 


34320 




CTCAGGGCCG 


TCTATGGTTC 


TTGGATCAGC 


TGAATCTCAA 


TGCATCGTGG 


TACCACATGC 


34380 




CATTAGCGAG 


TCGCTTGCGA 


GGCCCGCTTC 


GGATCGAGGC 


GCTGCAGTCA 


GCCCTGGCTA 


34440 
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CGATTGAGGC 


GCGGCACGAG 


TCCCTGCGCA 


5 


TTCAGATTGT 


ACGCGCTGCG 


CGCAACAAGC 




AGGATGCGTA 


TCTCGCAGCA 


TTGAAGCAAG 




AGCCAGGCTG 


GCGAGTAGCA 


CTGTTGCGCT 


10 


TCATGCACCA 


CATCATATCT 


GACGGATGGT 




AGCTCTACTC 


GAATGCCTCA 


TCGCAGCCCG 




CCATCTGGCA 


GAAGCAGGAl* 


AGTCAGATCG 


15 


AGAGACAACT 


GGTCAACAGC 


AAGCCGGCTG 




CGTTATCTGG 


CGATGCTGAT 


GTCATACCGA 




TCCGCTCGTT 


TTGTCGCGCT 


CGGCATGTCA 


20 


GGGCTGCTCA 


CTACCGCCTA 


ACTGGGGCCG 


ACAGAAATCG 


ACCTGAGCTT 


GAAGGCCTCA 




GAATTCCTGT 


TAAGAGCGAG 


GACACATTTG 




CGACCGAGGC 


CCAGGACAAC 


CAAGATGTCC 


25 


CTAGCTCGCG 


AGATACCTCG 


CGAAATCCAC 




AGCACGACCT 


TGGTAACATT 


CGTCTCGAAG 




CGTCCACACG 


CTTTGACGCG 


GAAATGCACC 


30 


ACGTCGTCTT 


TTCGAAGGAT 


CTGTTCGAAT 




TCCAGGAGAC 


CCTGAGGCGT 


GGCCTAGCCA 




TTACCGATGG 


ATTGCCCAGT 


CTTCGAAGCC 


35 


CCCGAGATGC 


CTCCGTGATC 


GACGTTTTCA 




TCGCCGTTAT 


CGATGCTTCT 


TCACAGCTCA 




AGCTCGCCAC 


GTGGCTACGC 


CGACAAGTCA 


40 


CTCCACGGTC 


CTGTGAGACA 


ATCATCGCTT 


ATCTGCCACT 


TGACGTCAAC 


GCACCCGCTG 




CAGGAAACAG 


GCTTATTTTA 


CTTGGATCAG 




GCGTTCGATT 


CACCCGGATC 


AGCGACGCCC 


45 


AACTTTCCAC 


ACGGCCGACT 


GCACAAAGCC 




CTGGCGTCCC 


GAAGGGTGTC 


ATGGTAGAGC 




GCAACGTGGT 


CGCAAAGCAA 


CCGGCAGCAG 


50 


TCGACGCCTC 


TTCCTGGGAG 


ATAXACGCTC 




TTGATTACTA 


CACCACGATC 


GATATCAAAG 




TCCGCGGAGC 


AATGCTTCCA 


CCAGCACTTC 
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CCACATTCGA 


GGAGCAAGAl 


GGTGTTCCCG 


34500 


AGCTGAGGAT 


CATCGACGTG 


TCGGGCACCG 


34560 


AGCAAGACGC 


CGCATTCGAT 


CTGACTGCTG 


34620 


TGGGACCGGA 


TGATCATGTC 


CTGTCTATCG 


34680 


CGGTTGATAT 


CCTGCGACAA 


GAACTCGGGC 


34740 


CTCCTCTTCC 


GATTCAATAC 


CGAGATTTCG 


34800 


CTGAGCACCA 


AAAGCAGCTG 


AACTACTGGA 


34860 


AGCTCCTGGC 


GGACTTCACT 


CGTCCGAAGG 


34920 


TAGAGATTGA 


TGACCAGGTA 


TATCAGAACC 


34980 


CCAGCTTTGT 


TGCACTCTTA 


GCAGCTTTCC 


35040 


AAGATGCAAC 


TATCGGCTCT 


CCAATCGCCA 


35100 


TTGGATGCTT 


TGTTAACACC 


CAGTGTCTCC 


35160 


ACACGTTGGT 


TAAACAGGCA 


CGAGAAACGG 


35220 


CGTTCGAGAG 


GATCGTTTCT 


TCCATGGTTG 


35280 


TCGTTCAGGT 


CATGTTTGCT 


GTGCACTCTC 


35340 


GTGTTGAGGG 


GAAGCCCGTT 


TCGATGGCAG 


35400 


TATTTGAGGA 


CCAAGGGATG 


CTCGGCGGCA 


35460 


CCGAGACGAT 


CCGCAGTGTT 


GTGGCCGTGT 


35520 


ATCCTCACGC 


AAATCTCGCA 


ACACTTCCTC 


35580 


TGTGTCTTCA 


AGTCAATCAG 


CCTGACTACC 


35640 


GAGAGCAGGT 


AGCATCGATA 


CCCAAGTCTA 


35700 


CCTACACCGA 


GCTCGACGAG 


AGATCTAGCC 


35760 


CAGTCCCTGA 


GGAGCTGGTC 


GGCGTCCTCG 


35820 


TCCTCGGCAT 


CATCAAAGCG 


AATCTCGCCT 


35880 


GTCGGATCGA 


GACAATCCTG 


TCATCTCTAC 


35940 


ATACGCAGGC 


GGTCAAGCTT 


CACGCAAACA 


36000 


TCGTCGAGAG 


CGGCAGTCCC 


CCTACCGAAG 


36060 


TTGCCTATGT 


CATGTTCACA 


TCAGGCTCAA 


36120 


ACCGGGGTAT 


CACACGTCTC 


GTGAAAAACA 


36180 


CTGCTATCGC 


TCATCTTTCG 


AACATTGCTT 


36240 


CTCTCCTTAA 


CGGCGGTACA 


GTCGTCTGCA 


36300 


CCCTCGAGGC 


GGTATTCAAA 


CAGCACCACA 


36360 


TCAAACAGTG 


TCTGGTCTCT 


GCCCCTACTA 


36420 
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TGATC AGC TC 


TC TGG AG AT A 


PTTTTPGCCG 


CCGGCGATCG 


GTTGAGPAGP 


PAAGATGPPA 

VvAAU A X Ov^V^ A 


J O S O U 


TCCTGGCGCG 


ACGTGCCGTT 


GGTTCGGGCG 


TTTAPAAPGP 

X X X AXht AAVvOVif 


TT APGGPPP T 

X X AUrwV3v«^^ X 


APTG AG A AP A 

A^ X OAVJAA^A 




CGGTCCTGAG 


TACGATACAC 


AACATPGGPG 


AGAATGAGGP 

AV3AA.X OAOwVir 


A X X X X wUaA X 


VavjCo X X k^WL. A 


oc£; Art 




TG TP AG T AAP 


TPPGGTGPPT 


TTGTP A TGG A 


TP A A A ATP AP 
1 VfAAAA X \« Alj 




^ c £ c n 




GATPGGAGAG 


PTTGTTGTGA 


PPrZPAPATPP 
Wl^OU AU A X Vjo 


l^L. X X u^I^^LjV^ 


ukjA X Av^ AL* Ao 


Q T rt 




T AGGGTPP A T 

X nwOw X X 


PG AT TP ATPT 


aX aX X A^V^^ X 


X \j AU VjVtVj AA<^ 


/^/"•/""/"•rpz-i A A ^ 

C-tjeGTCAGAG 


•3 ^ T O rt 


PTT APPriPAP 


PGGPPAPPPT 


PTP & PPP & PP 


pp/^r"T a 


X AAA X X 


GAGx X CTTCG 


368 4 0 


rip A A T A 


TP AGP AG A TP 


'h.'L.C 'h.'VfT'C'VC 
i\ir%\3i\ X X V7 


oX^aX L.Ln-.AX 


y-^/-^ A r^or' a A 
A\jrV-, v-. ACjC a 


G AGG T G G AGC 


36900 


A P/^P T P T P <^P 






AX XCCMjC TCjT 


TATCAC T CAG 


CTCACGGATG 


36960 


A A p A rsrs A^ipp 


GG A a P TGG T P 


^ «ri »p «s ^ «-p rp 


t-AX HjAAVjCjij 


vjAA X CjV^C^ AAC 


GGC AC C AACG 


37020 


PTi^TP A APPP 

V7 X o X ^ AA^OL? 


1 \3 X AOV^OA X 




X U Lj LiCiC A 


TGAGCAACAT 


GCTCTGCTGA 


37 080 


TP^AHA APA A 


GATPPGTP AP 


A APPT AP AGP 




V^AwTTACATG 


ATCCCCTCGC 


37 1 40 


PPATP A TPP A 




ViT J. 1 V, A 


Ax VsUL. AACwV? 


m TV A A »Tim/^ A ^ 


CGCAATGAGC 


37200 


TGGCTGTTPG 


AGPPP AGGP A 


APGPP A AGGA 


PP APTTPA/^T 


/^T*/^ A A /^r^rp A 
V3 X C AAV^C 1 AL* 


GTGGCCCCTC 


O T O ^ rt 

372 oO 


GP AACG AT AT 


PGAAAPPATP 


ATPTGT A AGG 


A ATTPPP AP A 

AAX Xv»\9VvAWA 


X AX X \^ AVtVi* 


GX XCGAGXQ^G 


"3 T "3 O rt 


G AAT C ACAG A 


PAAPTTPTTP 

\^njn\^ X X X X W 


G APPTGGGTG 


GAPAPTP APT 

V3A^ AW X V^AN^ X 


X A X AIjC w AC'V^ 


AAGC T AGCCG 


37380 


CCCGCCTTAG 


CCGTCGACTA 


GATACTCGCG 


TGTPTGTTAG 

XV7X1k»XV3X X tWS 


GGAPGTPTTT 
V90 AV^W X X X X 


p A P A r» T^^<^ 




TGGTAGGCCA 


ATTGGCGGCT 


TCTATCCAGC 


AAGGP TPG AP 


PPPTPATGAA 
wVm»V« X ^ A X V2AA 


GPTATTPPPP 
I^L'XAX X^C'GG 


O /ooo 


CGTTATCACA 


CTCGGGACCT 


G T GC AAC AGT 


PPTTTGPTPA 

W XXX \JVi» X ^ A 


AGGPPGTPTT 

AOVS^v^O X X X 


TPPTTPPTPP 


*7 C £ A 


ACCGTTTCAA 


TCTCAACGCT 


GCCTGGTACA 


TCATGPPATT 

X W A X V3\m>\^ A X X 


PGGPGTTPGT 


U X X L^G^oGAL. 


T7 CO A 

J / oZO 


CTCTCCGAGT 


CGATGCACTT 


C AG AC TCP AT 


TG AGGGP TP T 

X wAOwVJ W X w X 


PGAAGA AP^IG 
WOAAwAAx^WW 


P AP^ a^'PT^/^ 


J / ooO 


TACGCACCAC 


GT TCG AAG AA 


PAGGATGGPG 


TTGGTATGP A 
X X VjW X ax UIm a 


aAXCoX XvJAW 


TCrGCCCCGAA 


"3 "7 T il rt 

37740 


7GAGGGACAT 


CTGCGTCGTA 


GAPATPTPTG 


GPGPPAATf^A 
OV^w^V^AA X OA 


PP A "Pr^TTr^T^r* 


AAGL> X GAAGG 


"3 "7 O A A 

37800 


AGGAGCAGPA 


AGPTPPTT'PP 


A ATPTPTPT A 
AAl X\»>1 A 




nfim/^O A r^^f^rw* a 

X TVtVs AGGGT A 


GCACTCTTCA 


37860 


AGGCTGGAPA 


G A APP APPAP 


A 1 1 V« X ^ X A 


TV^IjTCATVjCA 


TCACATAATT 


■ TCAGATGGCT 


37920 


GGTPAGTTGA 


P ATPTTPPAP 


p & n /^/^nr>'P/~ 


CCCAATTCTA 


CTCGGTAGCT 


GTACGAGGGC 


37980 


ATGACCCCPT 


TTPPP AGGTP 


A/\/\V_k^OV^ i 


A X 1 Av^ xT A 


CCGCGATTT T 


GCTGTCTGGC 


38040 


AGAGACAAGA 


TAAGCAAGTT 


GCCGTTCACG 


AAAGCCAACT 


TCAGTACT3G 


ATAGAGCAGC 


381C0 


TCGCGGATAG 


CACGCCAGCC 


GAGATCCTAT 


CTGATTTTAA 


CCGACCGGAG 


GTCTTGTCCG 


38160 


GCGAAGCTGG 


TACAGTTCCC 


ATCGTGATCG 


AGGACGAGGT 


TTATGAGAAG 


C7CTCCCTCT 


38220 


TCTGCCGCAA 


TCATCAGGTC 


ACCAGCTTCG 


TCGTCCTTCT 


GGCTGCTTTC 


CGCGTCGCAC 


38280 


ATTATCGCCT 


AACTGGGGCA 


GAGGATGCGA 


CTATCGGTAC 


ACCAATTGCG 


AACCGCAACC 


38340 


GCCCCGAACT 


TGAGGACTTG 


ATCGGTTTCT 


TTGTCAATAC 


ACAATGCATG 


AGAATCGCGC 


38400 


TCGAAGAACA 


CGATAATTTC 


CTATCAGTAG 


TGCGAAGAGT 


TCGCTCAACA 


GCGGCAAGCG 


38460 
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CCTTCGAAAA 


CCAGGATGTG 


CCATTCGAGC 


GCCTTGTATC 


TGCACTTCTG 


CCCGGCTCTA 


38520 


GAGATGCCTC 


CCGGAATCCC 


CTCGTTCAAC 


TCATGTTTGT 


CGTCCACTCC 


CAGCGAAATC 


38580 


TCGGTAAACT 


GCAACTGGAG 


GGCTTGGAAG 


GCGAACCAAC 


CCCGTACACC 


GCGACGACCC 


38640 


GCTTCGATGT 


TGAGTTCCAC 


CTCTTCGAAC 


AAGACAAAGG 


CCTCGCCGGA 


AATGTTGTCT 


38700 


TCGCAGCAGA 


CTTGTTCGAG 


GCTGCCACTA 


TCCGCAGCGT 


TGTTGAAGTC 


TTCCACGAGA 


38160 


TCCTCCGTCG 


TGGTCTCGAC 


CAGCCAGATA 


TCGCAATTTC 


CACCATGCCA 


CTTGTCGATG 


38820 


GCCTGGCGGC 


GCTCAACAGC 


CGTAACTTAC 


CCGCAGTTGA 


AGACATCGAA 


CCTGACTTCG 


38*880 


CCACCGAGGC 


CTCGGTGGTT 


GATGTCTTCC 


AGACACAAGT 


GGTCGCTAAC 


CCAGATGCCC 


38940 


TGGCTGTGAC 


CGACACATCC 


ACAAAGCTTA 


CATATGCGGA 


GCTGGATCAA 


CAATCCGATC 


39000 


ATGTCGCGGC 


TTGGCTGTCC 


AAACAGAAGC 


TACCAGCAGA 


GAGCATCGTC 


GTTGTTCTTG 


39060 


CGCCACGATC 


CTCTGAGACT 


ATCGTAGCA? 


GCATTGGCAT 


CCTCAAAGCG 


AACCTCGCAT 


39120 


ATCTCCCCAT 


GGATTCCAAC 


GTCCCCGAAG 


CCCGTCGCCA 


AGCAATTCTT 


TCGGAGATTC 


39180 


CAGGGGAGAA 


GTTCGTTTTG 


CTIGGAGCAG 


GAGTGCCTAT 


TCCTGACAAC 


AAGACAGCTG 


39240 


ATGTCAGGAT 


GGTCTTCATC 


AGCGATATCG 


TCGCCAGCAA 


GACAGACAAG 


TCCTACTCAC 


39300 


CCGGCACTCG 


GCCATCTGCA 


TCAAGCCTTG 


CCTATGTTAT 


CTTCACATCA 


GGCTCGACAG 


39360 


GTCGGCCAAA 


GGGTGTCATG 


GTCGAGCATC 


GGGGTGTTAT 


TTCTTTGGTG 


AAGCAGAACG 


39420 


CTTCAAGAAT 


ACCACAAAGT 


CTGCGGATGG 


CACATGTTTC 


CAATCTCGCA 


TTCGATGCTT 


39480 


CCGTGrGGGA 


GATATTCACC 


ACGCTGCTCA 


ATGGAGGAAC 


GCTTTTCTGT 


ATCAGCTACT 


39540 


TTACTGTCTT 


GGACAGCAAA 


GCACTTTCTG 


CCGCTTTCTC 


CGATCATCGC 


ATTAACATCA 


39600 


CCCTGCTCCC 


ACCGGCCTTG 


CTCAAGCAAT 


GTCTTGCAGA 


CGCGCCATCT 


GTCCTGAGCT 


39660 


CCCTCGAGTC 


TCTGTACATT 


GGAGGCGACC 


GCCTTGATGG 


AGCTGATGCA 


ACCAAGGTGA 


39720 


AGGACCTCGT 


CAAAGGCAAG 


GCCTACAATG 


CCTACGGTCC 


CACCGAGAAT 


TCCGTCATGA 


39780 


GCACGATCTA 


TACCATCGAA 


CACGAGACTT 


TTGCGAATGG 


CGTTCCCATC 


GGCACATCTT 


39840 


TAGGCCCCAA 


GTCCAAGGCC 


TACATTATGG 


ACCAGGATCA 


GCAGCTCGTA 


CCAGCAGGCG 


39900 


TGATGGGAGA 


GCTTGTCGTT 


GCT3GCGATG 


GTCTCGCACG 


AGGGTATACC 


GATCCATCAC 


39960 


TGAACACGGG 


CCGGTTCATC 


CACATCACGA 


TCGATGGCAA 


ACAAGTTCAG 


GCATACCGGA 


40020 


CCGGCGATCG 


AGTCAGATAC 


CGACCTAGGG 


ACTACCAAAT 


CGAGTTCTTT 


GGCCGTTTAG 


40080 


ATCAGCAGAT 


CAAGATTCGC 


GGTCATCGCA 


TCGAGCCAGC 


TGAAGTGGAG 


CAGGCTCTTC 


40140 


TCAGCGACTC 


ATCGATCAAC 


GATGCCGTTG 


TTGTGTCGGC 


ACAAAACAAG 


GAGGGACTCG 


40200 


AAATGGTTGG 


TTACATCACG 


ACCCAGGCTG 


CACAATCCGT 


CGACAAGGAG 


GAAGCCAGCA 


40260 


ACAAGGTGCA 


GGAGTGGGAG 


GCTCATTTCG 


ACTCAACTGC 


ATATGCCAAC 


ATCGGGGGTA 


40320 


TTGATCGCGA 


TGCCCTCGGA 


CAGGACTTCT 


TATCCTGGAC 


ATCTATGTAC 


GACGGCTCAT 


40380 


TGATTCCCCG 


TGAAGAGATG 


CAGGAATGGC 


TCAACGACAC 


TATGCGCTCA 


CTCCTCGACA 


40440 
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■K ^^^^ m ^ ^^^^ ^ ^^^^ 

ACCAACCACC 


C GG AAAAt3 T G 


L- J. wo A\j A 1 l^V3 


u AAC* 1 V3V3 -1 AV.* 


W WW X A X V9 w -1 w 


CTGTTCAATC 


40500 


5 


TCGGCAAGGT 


TGAGGGACTA 


C AG AGC 1 A i G 


CCVjO 1 w 1 1 \3A 


VTWWW -1 W wWOW 


TCCGTCACCG 


40560 




CCTGGGTTAA 


CAAGGCAATC 


(j AAAQ. Ill WV^ 


C AAuCC 1 V30C 


A<^r? A Afipfipp 

avjVjaAoWVjW w 


PGAGTPCACG 

wOAO X W W AWO 


40620 




TTGGAACCGC 


CGAGGATATC 


AGCTCCATTG 


ATGGAcTTCo 


TTOOna TOTO 
1 1 wCoAl C i C 


f^TTflTfSATP A 
0 X X OX OAX WA 


40680 


10 


ACTCAGTCGC 


CCAATACTTC 


CCAAGTCGAG 


AATAU.C i CoC 


T»o a/^oT/; APrs 


OWv^AAW 1 X \3t\ 


407 40 


TTCGACTGCC 


CGGCGTTAAG 


CGTATTTTCT 


ICvjVjUjACAI 


r: A(^ A AOfST AT" 
V3 AoAaWO X a z 


riPT APP A AT A 

OW X A^WAAX A 


40300 




AGGACTTCTT 


GGTGGCACGA 


GCAGTuuATA 


CCC X Atjvjij 1 C 


P A A Trip A TPr^ 
W AA X I3WA 1 WO 


A APPPPATnn 

AAOOWW A X 00 


403 60 




TTCGACAACA 


AGTGGCCAAG 


C T T G AA' J AT Ca 


AC \j A\j V-3 AAVj A 


VJ 1 XoC i Xo- 


f^APPPTC^PPT 
OAWWW X OWW X 


40920 


15 


TCTTCACCAG 


C C T G AGC G AC 


CAGTTC'wCTG 


ACVjAAA 1 C AA 


<^P ATi^TPf^ A A 
OW A i 0 1 WVjAA 


ATTPTHPPAA 

A X X W X OWW AA 


40980 




AGAGGATGGC 


C GC G ACC AAC 


LiAAL. rC^AljC 1 


CI lACAuAiA 


X U W X OW X 0 X W 


Jt\ X X w A X 0 X 00 


41040 




GAGGCCACCA 


GATGCCGAAT 


GGGGAGGATCj 


a aT a ^nr* a 
A0V3A 1 AAvtC A 


A X VjrOoW X 0 X W 


A A^'Vi A T A T P A 

AAOOA X a X W A 


41100 


20 


ATCCGAAGGC 


CTGGGTGGAC 


1 i luv^loOCA 


CU AUU A 1 A 


PPf^TP AfifSPT 
WW\3 X WAOwW X 


OTpTTf^PAGP 
W X W * X OW AOW 


41160 




TCCTTCAGGA 


CCGCCAACuT 




1 iul IV9CCV3I 


P A<^TA APATP 
W AV3 X AAWA X W 


PP AT AP Af?P A 


41220 




AGACCATCAT 


GGAGCoCv^AT 


l.. 1 V9 1 ^ 1 AO 1 




X 0 AWUAOw AW 


GGCACTTPAG 


41280 


25 


CGGTAGACGG 


AACGGCCTGG 


ATATCoCGTA 


CurC>AA 1 C aCu 


ovsCoaAooaa 


TfZPPPT(^P*^P 
X OWWvw X OW - w 


41340 




TCTCAGTGGC 


CGACCTGATT 


G AG AT TGGT A 


AVjrfjV3V9 A 1 C OVJ 


W X X Wo AAO X X 


f5 Af^fiPP Af3PT 
OAOOWWAOW X 






GGGC T CG AC A 


ACACTCCCAG 


f^r^ r^f^f\t~'r^r^ tv f~« 


TOO aTTiOT'f^'P 

1 CvsA 1 oC 1 0 1 


TTTOP aPP<^ A 
X X XWWAWWOa 


TTP^XAAPPAP 
X xwoaawwaw 


41460 


30 


CAAGACACTC 


AGGTCATGTC 


TV rn/^ tpn*^ TV /~r^rp 
ATol KJAtjisl 


TooorsaOTTsa 
1 CCCurAC 1 UA 


A P A P A Al^fSrtP 
A W AW AAOOO W 


P<^fiTPTTPCA 

woo X W X X WOA 


41520 


GCAGTCTCAC 


GAATCGCCCG 




rpmoaoanooi^ 
1 1 CAvjAuCCu 


PPP APT/Z<^A<^ 
WCo AW 1 00 AVar 


(ZC A A A^^^^ TPP 

0 WAAAOO X WW 


•* A J 0 W 




GCG AGCGGC T 


GCAATCACTG 


CTTCCACCGT 


a 0 a T/n a t too 
ACAIoaI ICC 


i^'POTOl'i/i a TP 
Ol C IWOOAIC 


A Pf5 TT^SPTP*^ 
AWO X X OW X WO 


41640 




AT C AG AT GC C 


T C T C ACGTCC 


AAv^ uVaL. Aaoo 


Tr^*^ a TO/^o a a 
1 Ljo A 1 CVjC AA 


na arspTT^^PT 

0 AaoW 1 i> OW X 


Pf^APAAfiPPP 
WO AW AAOW WW 


417 00 


35 


GGGTCATCCC 


TV TV TV TV^TI/^/^/^ 


VjC AAoCAi^Vs 1 


TOO aOTT Tm* 
1 ooAC 1 1 lo 1 


00 W OW W AW OW 


APf^rtA AATPf^ 
AWOOAAA X WO 


417 60 






C i vjCVjAAoAA 


i i 1 AV«.CvjA-l 




WAAOO X X oow 


ATPAPAf^APA 


41820 




AC T T (_ T T A 


G T T C: C:: VjC Gb C 




Tooooaof^a a 
1 LtVjCCACoAA 


a p a r;p r;p A 

aW X OAOWOWa 


PPTPT A Ar^TP 
WO X W X AAO X W 


41880 


40 






AC 1 vj 1 tjAALjC 


ALjL? 1 C i 1 1 17 A 


PP APPP AnT A 
WW aOW W AO X A 


PTTf^TnATP 
W X X OW X OA X W 


41940 




1 i CjL- i oC i 1 C 


1 A 1 1 C i 1 C AA 


uoC i 1 CI C 


OTO a 0 a finTO 

0 i w AW AOw 1 W 


T aTPPPfSTPT 
X AX WWWOXW X 


TTAPPPTAPG 

X X AWWW X AWO 


42000 






LrtjrAVaCAts i v,t* 


111 V9C>CC AO%9 




(^TTPPTPf^AP 
0 X X WW X WOAW 


P A<^TTP AAPA 

W AO X X WAAWA 


42060 


45 


TCGATGCCTT 


GTGGTACCTT 


ATTCCATTTG 


CACTCCGCAT 


GCGCGGGCCG 


CTGCAAGTTG 


42120 




ACGCCCTCGC 


TGCTGCCCTG 


GTGGCACTTG 


AAGAGCGTCA 


TGAATCTCTG 


CGCACAACGT 


42180 




TTGAGGAACG 


AGACGGAGTC 


GGCATCCAAG 


TGGTGCAACC 


CCTCCGCACG 


ACCAAGGATA 


42240 


50 


TCCGGATCAT 


CGACGTGTCA 


GGCATGC6AG 


ACGACGACGC 


CTACCTCGAG 


CCATTGCAGA 


42300 


AAGAACAGCA 


GACTCCTTTC 


GACCTTGCTT 


CAGAGCCTGG 


CTGGAGGGTA 


GCACTGCTGA 


42360 




AGCTTGGAAA 


GGATGACCAC 


ATCCTCTCTA 


TTGTCATGCA 


CCACATCATC 


TCTGACGGGT 


42420 




GGTCTACTGA 


AGTCTTGCAA 


AGGGAACTCG 


GTCAATTCTA 


CTTGGCAGCG 


AAATCCGGGA 


42480 
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AAGCCCCCTT ATCGCAGGTT GCCCCGCTTC CTATTCAGTA TCGCGATTTT GCTGTTTGGC 42 54 0 

AGAGACAAGA GGAACAGGTC GCTGAGAGTC AAAGGCAGCT CGACTACTGG AAGAAGCAGC 42 600 

TTGCGGACAG TAGCCCGGCT GAGCTCTTGG CTGACTACAC CAGGCCGAAC GTACTGTCTG 42 660 

GAGAGGCAGG CAGCGTGTCT TTCGTGATCA ACGATTCGGT TTACAAGAGC CTCGTCTCCT 42720 

TCTGCCGGTC TCGCCAAGTA ACCACCTTTA CGACTTTACT GGCAGCGTTT CGCGCCGCTC 42780 

ACTATCGAAT GACCGGGTCA GACGACGCAA CTATTGGCAC GCCAATTGCC AATCGCAACA 42840 

GGCCTGAGCT TGAAAACTTG ATCGGCTGCT TCGTCAATAC CCAGTGCATG CGTATCACTA 42900 

TCGGCGACGA TGAGACGTTT GAATCACTGG TACAACAGGT ACGGTCTACC ACCGCGACAG 42 960 

CCTTCGAGAA TCAAGACGTT CCGTTTGAAC GAATCGTTTC CACCCTCAGT GCCGGGTCCA 43020 

GGGATACGTC CCGAAACCCC CTAGTACAGC TTCTCTTTGC GGTTCATTCT CAACAAGGCC 43080 

TGGGCAGGAT CCAGCTCGAC GGTGTCGTCG ATGAGCCGGT TCTGTCGACC GTTTCGACTC 43140 

GGTTCGATCT CGAATTCCAC GCCTTCCAAG AGGCCGACCG GCTCAATGGA AGTGTCATGT 43200 

TTGCCACG3A CCTGTTCCAG CCCGAGACCA TCCAAGGTTT CGTTGCGGTT GTCGAAGAGG 43260 

TTCTACAGCG TGGCCTGGAG CAGCCGCAGA GTCCCATCGC AACCATGCCG CTGGCCGAAG 43320 

GCATCGCTCA GCTCCGAGAT GCCGGCGCGC TGCAGATGCC AAAGTCTGAT TACCCTCGCA 43380 

ACGCGTCCCT CGTCGATGTC TTCCAGCAGC AGGCTATGGC CAGCCCGTCA ACTGTCGCCG 434 40 

TCACTGACTC GACCTCCAAG CTGACGTATG CCGAGCTGGA TCGACTCTCC GATCAAGCTG 43500 

CTTCCTATCT GCGTCGGCAG CAACTCCCGG CGGAGACAAT GGTGGCCGTT CTCGCACCGC 43560 

GCTCTTGTGA GACCATCATC GCGTTCCTAG CTATTCTCAA AGCAAATCTT GCCTACATGC 43620 

CTCTCGACGT CAACACGCCA TCTGCTCGCA TGGAAGCCAT CATATCGTCC GTCCCAGGGC 4 3680 

GTAGGCTCAT CTTGGTTGGC TCGGGCGTCC GCCATGCTGA TATCAACGTA CCGAACGCAA 43740 

AGACGATGCT GATCAGCGAC ACGGTTACCG GGACTGATGC TATTGGCACT CCCGAACCTC 43800 

TGGTTGTCCG ACCCAGTGCT ACAAGTCTCG CATATGTCAT CTTCACTTCA GGGTCAACGG 438 60 

GCAAGCCAAA GGGTGTCATG GTGGAGCACC GTGCTATCAT GCGCCTTGTG AAGGACAGTA 43920 

AC3TCGTGAC TCACATGCCA CCAGCGACAC GGATGGCCCA CGTCACGAAT ATCGCATTCG 439 80 

ACGTTTCACT GTTCGAGATG TGCGCAACGC TCCTCAACGG CGGAACTCTA GTCTGCATTG 440 40 

ACTACCTGAC CCTTCTTGAC AGCACCATGC TCCGGGAGAC GTTTGAGCGT GAGCAGGTTC 4 4100 

GCGCAGCCAT CTTCCCGCCA GCACTCCTGC GACAGTGCTT GGTCAACATG CCCGATGCGA 44160 

TCGGCATGTT AGAGGCTGTT TACGTTGCCG GTGATCGCTT CCACTCCCGC GACGCCCGCG 4 4220 

CAACCCAGGC ACTGGCCGGG CCTCGTGTGT ACAACGCGTA TGGCCCAACT GAGAACGCAA 44280 

TCCTTAGCAC GATATATAAC ATCGATAAGC ACGATCCGTA TGTGAACGGT GTTCCTATCG 44340 

GTAGCGCTGT CAGCAATTCA GGGGCCTATG TCATGGATCG GAACCAGCAG CTTCTCCCTC 44400 

CCGGTGTGAT GGGAGAGCTG GTTGTTACAG GAGAGGGTGT AGCTCGCGGC TATACCGACG 4 44 60 
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CAAGTCTCGA 
ACCGCACGGG 
GCCTGGACCA 
CTCTGCTCAG 
AGGACCCGCA 
ACGAAGAGGA 
CGCTTCTGCC 
ACGCCAACGG 
CAAGCAGCTC 
ACGAGTTCGA 
GCGGGCACTC 
GCATATCCGT 
AACAACAGCA 
AACTCCTCCC 
AGAACGGTCA 
TGCACGACAA 
AGACCGCCGA 
TATTCAGAAC 
TCGATGTACC 
TGCATGAAGC 
TCCTCAAGAG 
ACGGCTTGAG 
TTGCGCAAGC 
GCTACAATTT 
TCGGCGCCCT 
GGATCCCTCC 
TCTCTCTCTT 
CTGGACGACA 
TGCCTGTGCG 
AAGACCAGTA 
ACT-3CACGGA 
TCAACCTGCA 
CAAAGGATCA 
GCACGAATGG 



TACGGACCGC 
TGACCGGGTG 
GCAGGCCAAG 
CGAGAATTCA 
ACTGGTTGCC 
GGATCCGTAC 
ATACTACATG 
CAAGGTGGAC 
GGGCCCCGTG 
GACTATACTC 
ACTCCTGGCC 
CAAGGATCTG 
GGGGTTCTCG 
CGCGGAAATG 
CAGCACACCC 
AGCGACGGGC 
CTGCCGTCGT 
CGTGTTCGTG 
TGTCGAGGTC 
AGACAAGCAG 
ACCGGGAGCC 
TCTTGAACAC 
ACCCAAGTTT 
CTGGCGATCT 
CGAGGCCATG 
TGCGGCACTC 
GCTCGCCAAG 
GGATCTCTCC 
CGTTCGGATC 
CACCAGCAGC 
CTGGACTGAT 
TCCTGAGGCC 
AGCACGCCAG 
CACGAATGGC 



TTCGTCACCG 
CGATATCGAC 
ATTCGCGGCC 
GTCACGGATG 
TTTGTGACTA 
GCCACACAGG 
GTCCCGTCCC 
CGAAAAGACC 
CATGTGGCTC 
GGAGTCAAGG 
ACCAAACTCG 
TTTGACGATC 
GGAGAAGATG 
TCGAGAGAGA 
CTGGACATGT 
CACCCAGCCA 
CTGGCAAGCG 
TCAAGAGGCG 
ATCGAGACCG 
CAGCCCCTAC 
AAGATGCGAC 
ATCGTCAACG 
GGTCTCTACA 
ATTCTTCAGG 
ACGCCGTCTG 
AAGAACGGCA 
CATACCAAGT 
ATAAACTGCC 
GACGAGGGCG 
TTCCGGCACG 
GCGACCAAGG 
GAGATTGAAG 
GCCAATGGTC 
GCGAACGGCA 



TCACGATCGA 
CAAAGGGATT 
ACCGTGTTGA 
CGGCTGTCGT 
CTGATCACGA 
CAGCAGGCGA 
GGGTCACAAT 
TCGCTCGGCG 
CTCGCAACGA 
TGGGAATCAC 
CTGCTCGGCT 
CTGTTCCTGT 
AAAGCTCGAC 
TCATCCAGCG 
ATCCAGCCAC 
CGCCGCCACT 
CCTGCGCCGC 
GCCGCTTCTA 
AGCAAGAGTT 
GTCTGGGACG 
TTGTTCTCCG 
CTCTACATGC 
TGCATCACAT 
GCTCTTCAAT 
CCGGTACATG 
TTACGCAGGC 
CGACAGACGT 
AAGACATCGT 
ACGACATGGG 
AGACCTTGGG 
AGTTCAGTTG 
GGCAGCAGAT 
ATGCCCCAAA 
CGAATGGCAC 



TGGCCAGCGC 
CCAGATAGAG 
ACTGGGCGAG 
ACTCCGCACC 
ATATCGCTCG 
TATGCGCAAG 
ACTCAGGCAA 
GGCCCAGATG 
GACTGAGGCA 
AGACAACTTC 
CAGCCGCCGG 
TTCTCTCGCC 
AGTTGGTATT 
CGATGTTGTA 
GCAGACGCAG 
GTTCTCCTTG 
TCTCGTCCAG 
CCAAGTTGTT 
GGATGAGGTT 
TGCGATGCTG 
AATGTCTCAT 
CTTGTACAGT 
GGCTAGCCGA 
GACATCCCTG 
GCAGACGTCA 
GACCCTCTTC 
CGTCTTCGGC 
QGGACCTTGC 
TGGTCTGCTG 
CTTGCAAGAA 
CTGCATTGCC 
TCGCCTGGAG 
TGGCACGAAC 
GAATSGCACG 



CAGAGGGCGT 
TTCTTCGGCC 
GTCGAACATG 
ATGGAAGAGG 
GGTTCGAGCA 
CGACTCCGGT 
ATGCCTCTCA 
ACTCCGACAG 
GCAATTTGCG 
TTCGAACTAG 
ATGGGCCTTC 
GGCAAGCTGG 
GTCCCCTTCC 
CCTCAGATTG 
ATCTTCTTCC 
GACTTCCCCG 
CACTTTGACA 
CTTGCTCATC 
GCTCTCGCGC 
CGGATCGCCA 
TCCCTGTACG 
GATAAGCACC 
CGTGCAGAGG 
AAGCGCTCTG 
AAGTCCATCA 
ACCGCCGCCG 
CGCGTCGTAT 
ATCAACGAGG 
CGCGCCATTC 
GTGAAGGAGA 
TTCCAGAACC 
GGTTTGCCAG 
GGCACGAATG 
AATGGTACCC 



44520 

44580 

44640 

44700 

44760 

44820 

44880 

44940 

45O0O 

45060 

45120 

45180 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 

45900 

45960 

46020 

46080 

46140 

46200 

46260 

46320 

46380 

46440 

46500 
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ATGCCAACGG TATCAATGGT AGCAACGGTG TCAATGGCCG CGATAGCAAC GTGGTTTCAG 4 6560 
^ CCGCTGGCGA TCAAGCTCCT GTTCACGATC TGGACATTGT TGGGATTCCG GAGCCCGACG 46 620 

GCAGCGTCAA GATTGGCATT GGTGCGAGCC GGCAGATCCT TGGAGAGAAG GTCGTGGGCA 46680 
GCATGCTCAA TGAACTTTGC GAGACCATGC TCGCTTTGAG CAGAACATAG CAGCTTTTCC 46740 
10 AGGGAGATTG GTTGGATGGA CAAGATTCTC TTCAATTATG GAGGTTGGCA TGAGGCAACA 46800 

GGAGGACTAC TGACTTTTCA TGTTTTTTGG GGTTTTTTGG GGTTTTCTTT TTCCTTTCAT 46860 
CTTTACTTGA TGCGCGATGT CTGCTTTCCT CTAGAATTC 46899 
(2) INFORMATION FOR SEQ ID NO: 2: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

20 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tolypocladixam niveum 

(B) STRAIN: ATCC 34921 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Ala lie Gly Gin Asp Met Ala Tyr Asp Arg Leu Ala Asn Pro 
15 10 15 

Ser Arg Ala Sar Ser lie Ser Ser Asn Arg Tyr Ser Glu Pro Val Glu 

20 25 30 

35 Gin Ser Phe Ala Gin Gly Arg Leu Trp Phe Leu His Gin Leu Lys Leu 

35 40 45 

Gly Ala Ser Trp Asp lie Thr Pro Ala Ala He Arg Leu Arg Gly His 
50 55 60 

Leu Asp He Asp Ala Leu Asn Ala Ala Ser Arg Ala Leu Thr Gin Arg 
^ 65 70 75 80 

His Glu Thr Leu Arg Thr Thr Phe Lys Glu Gin Asp Gly Val Glv Val 
85 90 95 
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Gin Val Val His Ala Ser Gly Leu Glu Arg Gly Leu Arg He Val Asp 
100 105 110 

Ala Ser Ser Arg Asp Leu Ala Gin Leu Leu Ala Glu Glu Gin Thr Met 
115 120 125 

Lys Phe Asp Leu Glu Ser Glu Pro Ala Trp Arg Val Ala Leu Leu Lys 
130 135 140 

Val Ala Glu Asp His His He Leu Ser He Val Val His Kis He He 
145 150 155 160 

Ser Asp Ser Arg Ser Leu Asp He He Gin Gin Glu Leu Gly Glu Leu 
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165 170 175 



Tyr Thr Ala Ala Ser Gin Gly Lys Ser He Ser Ala Cys Pro Leu Glv 
180 185 190 

Pro He Pro lie Gin Tyr Arg Asp Leu Thr Thr Trp Gin Asn Gin Asp 
195 200 205 

Glu Gin Val Ala Glu Gin Glu Arg Gin Leu Gly Tyr Trp lie Glu Gin 
210 215 220 

Leu Asp Asn Asn Thr Pro Ala Glu Leu Leu Thr Glu Leu Pro Are Pro 
225 230 235 240 

Ala He Pro Ser Gly Glu Thr Gly Lys He Ser Phe Gin He Asp Glv 
245 250 255 

Ser Val His Lys Glu Leu Leu Ala Phe Cys Arg Ser Gin Gin Val Thr 
260 265 270 

Ala Tyr Ala Val Leu Leu Ala Ala Phe Arg Val Ala His Phe Arg Leu 
275 280 285 

"^^"^ ^^"^ Ala P^o Val Ala Asn Arg Asp 

290 295 300 

Arg Pro Glu Leu Glu Asn Met Val Ala Pro Leu Ala Thr Leu Gin Cvs 

310 315 320 

Met Arg Val Val Leu Asp Glu Asp Asp Thr Phe Glu Ser Val Leu Ara 
325 330 335 

Gin He Met Ser Val Met Thr Glu Ala His Ala Asn Arg Asp Val Pro 
340 345 550 

Phe Glu Arg He Val Ser Ala Leu Leu Pro Gly Ser Thr Asp Thr Ser 
355 360 365 

Arg His Pro Leu Val Gin Leu Met Phe Ala Leu His Pro Ala Gin Asp 
370 375 380 

"^^P Gly l-eu Glu Ala Glu Thr Leu Gin Ser 

385 390 395 400 

Ala Ala Pro Thr Arg Phe Asp Met Glu Met His Leu Phe Glu Gly Asp 
405 410 415 

Asp Arg Phe Asp Ala Asn Val Leu Phe Ser Thr Gly Leu Phe Aso Ala 
40 420 425 430 

Glu Ala He Arg Ser Val Val Ser He Phe Arg Glu Val Leu Arg Arg 
435 440 — - ^ ^ 



445 



Gly He Ser Glu Pro Ala Val His Val Lys Thr Met Pro Leu Thr Asp 
4^0 455 460 

Gly Leu Ala Ala He Arg Asp Met Gly Leu Leu Asp He Gly Thr Thr 

^70 475 480 

Asp Tyr Pro Arg Glu Ala Ser Val Val Asp Met Phe Gin Glu Gin Val 
485 490 495 

Ala Leu Asn Pro Ser Ala Thr Ala Val Ala Asp Ala Ser Ser Ara Leu 
^"0 505 510 

Ser Tyr Ser Glu Leu Asp His Lys Ser Asp Gin Leu Ala Ala Trp Leu 
515 520 525 
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Arg Arg Arg Gin Leu Lys Pro Glu Thr Leu lie Gly Val Leu Ser Pro 
530 535 540 

Pro Ser Cys Glu Thr Met Val Ser Phe Leu Gly lie Leu Lys Ala Kis 
545 550 555 560 

Leu Ala Tyr Leu Pro Leu Asp lie Asn Val Pro Leu Ala Arg lie Glu 
565 570 575 

Ser lie Leu Ser Ala Val Asp Gly His Lys Leu Val Leu Leu Gly Ser 

580 585 590 

Asn Val Pro Gin Pro Lys Val Asp Val Pro Asp Val Glu Leu Leu Arg 
595 600 605 

lie Ser Asp Ala Leu Asn Gly Ser Gin Val Asn Gly Leu Ala Gly Lys 
^5 610 615 620 

Gin Ala Thr Ala Lys Pro Ser Ala Thr Asp Leu Ala Tyr Val lie Phe 
625 630 635 640 

Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Val Met lie Glu His Arg 
20 645 650 655 

Gly lie Val Arg Leu Val Lys Gly Thr Asn lie lie Ser Pro Ala Gin 
660 665 670 

Ala Ala Val Pro Thr Ala His Leu Ala Asn lie Ala Phe Asp Leu Ser 
25 675 680 685 

Thr Trp Glu lie Tyr Thr Pro lie Leu Asn Gly Gly Thr Leu val Cys 
690 695 700 

lie Glu His Ser Val Thr Leu Asp Ser Lys Ala Leu Glu Ala val Phe 
30 705 710 715 720 

Thr Lys Glu Gly lie Arg Val Ala Phe Leu Ala Pro Ala Leu lie Lys 
725 730 735 

Gin Cys Leu Ala Asp Arg Pro Ala lie Phe Ala Gly Leu Asp Ser Leu 
35 740 745 750 

Tyr Ala lie Gly Asp Arg Phe Asp Arg Arg Asp Ala Leu His Ala Lys 
755 760 765 
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Ser Leu Val Lys His Gly Val Tyr Asn Ala Tyr Gly Pro Thr Glu Asn 
770 775 780 

Ser Val Val Ser Thr lie Tyr Ser Val Ser Glu Ala Ser Pro Phe Val 
785 790 795 800 

Thr Gly Val Pro Val Gly Arg Ala lie Ser Asn Ser Gly Ala Tyr Val 

805 810 815 

Met Asp Gin Asp Gin Gin Leu Val Ser Pro Gly Val Met Gly Glu Leu 
820 825 830 

Val Val Ser Gly Asp Gly Leu Ala Arg Gly Tyr Thr Asp Ser Ala Leu 
835 840 845 

Asp Lys Asn Arg Phe Val Val Val Gin lie Asp Gly Glu Ser lie Arg 
850 855 860 

Gly Tyr Arg Thr Gly Asp Arg Ala Arg Tyr Ser Leu Lys Gly Gly Gin 
865 870 875 880 
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He Glu Phe Phe Gly Arg Met Asp Gin Gin Val Lys He Arg Gly His 
885 890 895 

Arg lie Glu Pro Ala Glu Val Glu Kis Ala Leu Leu Asn Ser Asp Gin 
900 905 910 

Val Arg Asp Ala Ala Val Val He Arg Arg Gin Glu Glu Glu Glu Pro 
915 920 925 

Ala Mec He Ala Phe Val Thr Thr Gin Gly Thr Leu Pro Asp His Leu 
930 935 940 

Val Asn He Asn Gly Asn Gly His Val Pro Asp Gly Asn Gly Ser Lys 
^45 950 955 960 

Asn Asp Gin Phe Ala Val His Val Glu Ser Glu Leu Arg Arg Arg Leu 
965 970 975 

Gin Met Leu Leu Pro Ser Tyr Met Met Pro Ala Arg He Val Val Leu 
980 985 990 

Asp His Leu Pro Leu Asn Pro Asn Gly Lys Val Asp Arg Lys Ala Leu 
995 1000 1005 

Gly Gin Ser Ala Lys Thr Val Gin Lys Ser Lys Leu Val Ser Gin Aro 
1010 1015 1020 

Val Ala Pro Arg Asn Glu He Glu Ala Val Leu Cys Glu Glu Tyr Ara 
^025 1030 1035 1040 

Ser Val Leu Gly Val Glu Val Gly He Thr Asp Asn Phe Phe Asp Leu 
1045 1050 1055 

Gly Gly His Ser Leu Thr Ala Met Lys Leu Ala Ala Arg He Ser Gin 
1060 1065 1070 

Arg Leu Asp He Gin Ala Ser Val Ala Thr Val Phe Glu Gin Pro Met 
1075 1080 1085 

Leu Ala Asp Leu Ala Ala Thr He Gin Arg Gly Ser Thr Leu Tvr Ser 
1090 1095 1100 

YfL"^-^® "^y^ '^^^ Gly Pro Val Glu Gin Ser Phe Ala 

1^05 1110 1115 1120 

Gin Gly Arg Leu Trp Phe Leu Glu Gin Leu Asn Thr Gly Ala Ser Trp 
1125 1130 1135 

Tyr Asn Val Met Leu Thr Val Arg Leu Arg Gly His Leu Asp Val Asp 
1140 1145 1150 

Ala Leu Gly Thr Ala Leu Leu Ala Leu Glu Lys Arg His Glu Thr Leu 
1155 1160 1165 

45 Arg Thr Thr Phe Glu Glu Arg Asp Gly Val Gly Met Gin Val Val His 

1170 1175 1180 

ffor^®"" ^-9 Leu He Asp He Ser Glu Lys Ser 

1183 1190 1195 ^ 1200 

Gly Thr Ala Ala His Glu Ala Leu Met Lys Glu Gin Ser Thr Arg Phe 
1205 1210 1215 

Asp Leu Thr Arg Glu Pro Gly Trp Arg Val Ala Leu Leu Lys Leu Ala 
1220 1225 1230 

Asp His His He Phe Ser He Val Met His His He Val Ser Asp Gly 
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1235 1240 1245 

Trp Ser Leu Asp Leu Leu Arg His Glu Leu Gly Gin Leu Tyr Ser Ala 
1250 1255 1260 

Ala Leu Arg Gly Gin Asp Pro Leu Ser Arg Leu Glu Pro Leu Pro Tie 
1265 1270 1275 1280 

Gin Tyr Arg Asp Phe Ala Val Trp Gin Lys Gin Asp Ser Gin Gin Lys 
1285 1290 1295 

Ala Ala His Gin Arg Gin Leu Glu Tyr Trp Thr Lys Gin Leu Ala Asp 
1300 1305 1310 

Ser Thr Pro Ala Glu Leu Leu Thr Asp Phe Pro Arg Pro Ser He Leu 
1315 1320 1325 

Ser Glv Lys Ala Gly Lys Val Pro Val Ala He Glu Gly Ser Leu Tyr 
1330 1335 1340 

Aso Thr Leu Gin Val Phe Ser Arg Thr His Gin Val Thr Ser Phe Ala 
1345 1350 1355 1360 

Val Leu Leu Ala Ala Phe Arg Ala Ala His Phe Arg Leu Thr Gly Ser 
1365 1370 1375 

Aso Asn Ala Thr He Gly Val Pro Ser Ala Asn Arg Asn Arg Pro Glu 
1380 1385 1390 

Leu Glu Asn Val He Gly Phe Phe Val Asn Thr Gin Cys He Arg He 
1395 1400 1405 

Thr He Asp Glu Asn Asp Asn Phe Glu Ser Leu Val Arg Gin Val Arg 
1410 1415 1420 

Ser Thr Thr Thr Ala Ala Gin Asp Asn Gin Asp Val Pro Phe Glu Gin 
1425 1430 1435 1440 

Val Val Ser Ser Leu Met Pro Ser Ser Ser Arg Asp Ala Ser Arg Asn 
1445 1450 1455 

Pro Leu Val Gin Leu Met Phe Ala Leu His Gly Gin Gin Asp Leu Phe 
1460 1465 1470 

Lys He Gin Leu Glu Gly Thr Glu Glu Glu Val He Pro Thr Glu Glu 
1475 1480 1485 

Val Thr Arg Phe Asp He Glu Phe His Leu Tyr Gin Gly Ala Ser Lys 
1490 1495 1500 

Leu Ser Gly Asp He He Phe Ala Ala Asp Leu Phe Glu Ala Glu Thr 
1505 1510 1515 1520 

He Arg Gly Val Val Ser Val Phe Gin Glu Val Leu Arg Arg Gly Leu 
1525 1530 1535 

Gin Gin Pro Gin Thr Pro He Met Thr Met Pro Leu Thr Asp Gly He 
1540 1545 1550 

Pro Glu Leu Glu Arg Met Gly Leu Leu His Met Val Lys Thr Asp Tyr 
1555 1560 1565 

Pro Arg Asn Met Ser Val Val Asp Val Phe Gin Gin Gin Val Arg Leu 
1570 1575 1580 

Ser Ala Glu Ala Thr Ala Val He Asp Ser Ser Ser Arg Met Ser Tyr 
1585 1590 1595 1600 
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Ala Glu Leu Asp Gin Arg Ser Asp Gin Val Ala Ala Trp Leu Arg Gin 
1605 1610 1615 

Arg Gin Leu Pro Ala Glu Thr Phe Val Ala Val Leu Ala Pro Arg Ser 
1620 1625 1630 

Cys Glu Ala Val lie Ala Leu Phe Gly lie Leu Lys Ala Gly His Ala 
1635 1640 1645 

Tyr Leu Pro Leu Asp Val Asn Val Pro Ala Ala Arg Leu Arg Ala lie 
1650 1655 1660 

Leu Ala Glu Val Lys Gly Glu Lys Leu Val Leu Leu Gly Ala Gly Glu 
1665 1670 1675 1680 

Pro Ser Pro Glu Gly Gin Ser Pro Glu Val Ser lie Val Arg lie Ala 
1685 1690 1695 

Asp Ala Thr Ser Pro Ala Gly His Ala Ser Leu Arg Asp Gly Lys Ser 
1700 1705 1710 

Lys Pro Thr Ala Gly Ser Leu Ala Tyr Val lie Phe Thr Ser Gly Ser 
^ 1715 1720 1725 

Thr Gly Lys Pro Lys Gly Val Met lie Glu His Arg Gly Val Leu Arg 
1730 1735 1740 

Leu Val Lys Gin Thr Asn lie Leu Ser Ser Leu Pro Pro Ala Gin Thr 
25 1745 1750 1755 1760 

Phe Arg Met Ala His Met Ser Asn Leu Ala Phe Asp Ala Ser lie Trp 
1765 1770 1775 

Glu val Phe Thr Ala Leu Leu Asn Gly Gly Ser Leu val Cys lie Asp 
30 1780 1785 1790 

Arg Phe Thr lie Leu Asp Ala Gin Ala Leu Glu Ala Leu Phe Leu Arg 
1795 1800 1805 

Glu His lie Asn lie Ala Leu Phe Pro Pro Ala Leu Leu Lys Gin Cys 
35 1810 1815 1820 

Leu Thr Asp Ala Ala Ala Thr lie Lys Ser Leu Aso Leu Leu Tyr Val 
1825 1830 1835 * 1840 

Gly Gly Asp Arg Leu Asp Thr Ala Asp Ala Ala Leu Ala Lys Ala Leu 
•40 1845 1850 1855 

Vai Lys Ser Glu Val Tyr Asn Ala Tyr Gly Pro Thr Glu Asn Thr Val 
1860 1865 1870 
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Met Ser Thr Leu Tyr Ser lie Ala Asp Thr Glu Arg Phe Val Asn Gly 
1875 1880 1885 

Val Pro lie Gly Arg Ala Val Ser Asn Ser Gly Val Tyr Val Met Asp 
1890 1895 1900 

Gin Asn Gin Gin Leu Val Pro Leu Gly Val Met Gly Glu Leu Val Val 
1905 1910 1915 1920 

Thr Gly Asp Gly Leu Ala Arg Gly Tyr Thr Asn Pro Ala Leu Asp Ser 
1925 1930 1935 

Asp Arg Phe Val Asp Val lie Ala Arg Gly Gin Leu Leu Arg Ala Tyr 
1940 1945 1950 
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Arg Thr Gly Asp Arg Ala Arg Tyr Arg Pro Lys Asp Gly Gin Val Glu 
1955 1960 1965 

Phe ?he Gly Arg Met Asp His Gin Val Lys Vel Arg Gly His Arg lie 
5 1970 1975 1980 

Glu Leu Ala Glu Val Glu His Ala Leu Leu Ser Ser Ala Gly Val His 
1985 1990 1995 2000 

Aso Ala Val Val Val Ser Asn Ser Gin Glu Asp Asn Gin Gly Val Glu 
10 2005 2010 2015 

Met Val Ala Phe lie Thr Ala Gin Asp Asn Glu Thr Leu Gin Glu Ala 
2020 2025 2030 

Gin Ser Ser Asn Gin Val Gin Glu Trp Glu Ser His Phe Glu Thr Thr 
15 2035 2040 2045 

Ala Tyr Ala Asp lie Thr Ala lie Asp Gin Asn Thr Leu Gly Arg Asp 
2050 2055 2060 

Phe Thr Ser Trp Thr Ser Met Tyr Asp Gly Thr Leu lie Asp Lys Arg 
20 2065 2070 2075 2080 

Glu Met Gin Glu Trp Leu Asp Asp Thr Met Arg Thr Phe Leu Asp Gly 
2085 2090 2095 
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Gin Ala Ala Gly His Val Leu Glu lie Gly Thr Gly Thr Gly Met Val 
2100 2105 2110 

Leu Phe Asn Leu Gly Gin Ala Gly Leu Lys Ser Tyr lie Gly Leu Glu 
2115 2120 2125 

Pro Ser Gin Ser Ala Val Gin Phe Val Asn Lys Ala Ala Gin Thr Phe 
2130 2135 2140 

Pro Gly Leu Glu Gly Lys Ala Gin Val His Val Gly Thr Ala Met Asp 
2145 2150 2155 2160 

Thr Gly Arg Leu Ser Ala Leu Ser Pro Asp Leu lie Val lie Asn Ser 
2165 2170 2175 

Val Ala Gin Tyr Phe Pro Ser Arg Glu Tyr Leu Ala Glu Val Val Glu 
2180 2185 2190 

Ala Leu Val Arg lie Pro Gly Val Arg Arg lie Phe Phe Gly Asp Met 
2195 2200 2205 

Arg Thr Tyr Ala Thr His Lys Asp Phe Leu Val Ala Arg Ala Val His 
2210 2215 2220 

Thr Asn Gly Ser Lys Val Thr Arg Ser Lys Val Gin Gin Glu Val Ala 
2225 2230 2235 2240 

Arg Leu Glu Glu Leu Glu Glu Glu Leu Leu Val Asp Pro Ala Phe Phe 
2245 2250 2255 

Thr Ser Leu Lys Glu Ser Leu Ser Glu Glu lie Glu Kis Val Glu lie 
2260 2265 2270 

Leu Pro Lys Asn Met Lys Val Asn Asn Glu Leu Ser Ser Tyr Arg Tyr 
2275 2280 2285 

Gly Ala Val Leu His lie Arg Asn His Asn Gin Asn Gin Ser Arg Ser 
2290 2295 2300 

lie His Lys lie Asn Ala Glu Ser Trp lie Asp Phe Ala Ser Ser Gin 
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2305 2310 2315 

Met Asp Arg Gin Gly Leu Ala Arg Leu Leu Lys Glu Asn Lys Asp Ala 
2325 2330 2335 

Glu Ser lie Ala Val Phe Asn He Pro Tyr Ser Lys Thr He Val Glu 
2340 2345 2350 

Arg His lie Ala Lys Ser Leu Ala Asp Asp His Asp Gly Asp Asp Thr 
2355 2360 2365 

ft^n^^"" Ala Ala Arg Glu Lys 

^^'^ 2375 2380 

Ala Ser Gin Cys Pro fer Leu Asp Val His Asp Leu Val Gin Leu Ala 
2385 2390 2395 2400 

Glu Asp Ala Gly Phe Arg Val Glu Val Ser Trp Ala Arg Gin Arg Ser 
2405 2410 2415 

Gin Asn Gly Ala Leu Asp Val Phe Phe His His Phe Gin Pro Thr Glu 
2420 2425 2430 

Asn Glu Ser Arg Ala Leu Val Asp Phe Pro Thr Asp Tyr Lys Gly Gin 
^^-i^ 2440 2445 

^itn^""^ ^^"^ Gin Arg Val Glu Ser Arg 

^^^^ 2455 2460 

^1%^'^^^ Y*^ ^""^ J^®" Gin Val Leu Leu Pro Ala 

2465 2470 2475 2480 

Tyr Met He Pro Ala Arg He Val Val Leu Gin Asn Met Pro Leu Asn 
2483 2490 2495 

Thr Ser Gly Lys Val Asp Arg Lys Glu Leu Thr Leu Arg Ala Lys Val 
2500 2505 2510 

Thr Ala Ala Arg Thr Pro Ser Ser Glu Leu Val Ala Pro Arg Asp Ser 
2515 2520 2525 

?^^n^^* Asp val Leu Gly Val Glu 

''S-'O 2535 2540 

Val Gly He Thr Asp Asn Phe Phe Asn Val Gly Gly His Ser Leu Leu 

2550 2555 2560 

Ala Thr Lys Leu Ala Ala Arg Leu Ser Arg Gin Leu Asn Ala Gin He 
2565 2570 2575 

Ala Val Lys Asp He Phe Asp Arg Pro Val He Ala Asp Leu Ala Ala 

2585 2590 

Thr He Gin Gin Asp Thr Thr Glu His Asn Pro He Leu Pro Thr Ser 
^^'5 2600 2605 

'^^'^ Isfo^^^ ""^^ tl"^.^^^ Gin Gly Arg Leu Trp Phe 

2615 2620 

Leu ASP Gin Leu Asn Val Gly Ala Thr Trp Tyr Leu Met Pro Phe Ala 
" 2630 2635 2640 

val Arg Leu Arg Gly Pro Leu Val Val Ser Ala Leu Ala Ala Ala Leu 
2645 2650 2655 

Leu Ala Leu Glu Glu Arg His Glu Thr Leu Arg Thr Thr Phe He Glu 
SS 2660 2665 2670 
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Gin Giu Gly He Gly Met Gin Val He His Pro Phe Ala Pro Lys Giu 
2675 2680 2685 

Leu Arg val He Asp val Ser Gly Giu Giu Giu Ser Thr He Gin Lys 
2690 2695 2700 

He Leu Giu Lys Giu Gin Thr Thr Pro Phe Asn Leu Ala Ser Giu Pro 
2705 2710 2715 2720 

Gly Phe Arg Leu Ala Leu Leu Lys Thr Gly Giu Asp Giu His He Leu 
2725 2730 2735 

Ser Thr Val Met His His Ala He Ser Asp Gly Trp Ser Val Asp He 
2740 2745 2750 

Phe Gin Gin Giu He Gly Gin Phe Tyr Ser Ala He Leu Arg Gly His 
2755 2760 2765 

Asp Pro Leu Ala Gin He Ala Pro Leu Ser He Gin Tyr Arg Asp Phe 
2770 2775 2780 

Ala Thr Trp Gin Arg Gin He Phe Gin Val Ala Giu His Arg Arg Gin 
2785 2790 2795 2800 

Leu Ala Tyr Trp Thr Lys Gin Leu Ala Asp Asn Lys Pro Ala Giu Leu 
2805 2810 2815 

Leu Thr Asp Phe Lys Arg Pro Pro Met Leu Ser Gly Arg Ala Gly Giu 
2820 2825 2830 

He Pro Val Val Val Asp Gly Leu He Tyr Giu Lys Leu Gin Asp Phe 
2835 2840 2845 

Cys Arg He Arg Gin Val Thr Ala Phe Thr Val Leu Leu Ala Ala Phe 
2850 2855 2860 

Arg Ala Ala His Tyr Arg Met Thr Gly Thr Giu Asp Ala Thr He Gly 
2865 2870 2875 2880 

Thr Pro He Ala Asn Arg Asn Arg Pro Giu Leu Giu Gly Leu He Gly 
2885 2890 2895 

Phe Phe Val Asn Thr Gin Cys Met Arg He Thr Val Asp Val Giu Asp 
2900 2905 2910 

Ser Phe Giu Thr Leu Val His Gin val Arg Giu Thr Thr Leu Ala Ala 
2915 2920 2925 

His Ala Asn Gin Asp Val Pro Phe Giu Gin He Val Ser Asn He Leu 
2930 2935 2940 

Pro Gly Ser Ser Asp Thr Ser Arg Asn Pro Leu Val Gin Leu Met Phe 
2945 2950 2955 2960 

Ala Leu His Ser Gin Gin Asn Leu Gly Lys Val Arg Leu Giu Gly He 
2965 2970 2975 

Giu Giu Giu He He Ser He Ala Giu Thr Thr Arg Phe Asp He Giu 
2980 2985 2990 

Phe His Leu Tyr Gin Giu Ala Giu Arg Leu Asn Gly Ser He Val Tyr 
2995 3000 3005 

Ala Ala Asp Leu Phe Val Pro Giu Thr He Gin Ser Val He Thr He 
3010 3015 3020 
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Phe Gin Gly lie Leu Gin Lys Gly Leu Gly Glu Pro Asp Mer Pro Val 
3025 3030 3035 3040 

Ala Ser Met Ala Leu Asp Gly Gly Leu Glu Ser Leu Arg Ser Thr Gly 
3045 3050 3055 

Leu Leu His Pro Gin Gin Thr Asp Tyr Pro Cys Asp Ala Ser val Val 
3060 3065 3070 

Gin lie Phe Lys Gin Gin Val Ala Val Asn Pro Asp Val lie Ala Val 
3075 3080 3085 

Arg Asp Glu Ser Thr Arg Leu Ser Tyr Ala Asp Leu Asp Arg Lys Ser 
3090 3095 3100 

Asp Gin Val Ala Cys Trp Leu Ser Arg Arg Gly lie Ala Pro Glu Thr 
3105 3110 3115 3120 

Phe Val Ala lie Leu Ala Pro Arg Ser Cys Glu Thr lie Val Ala lie 
3125 3130 3135 

Leu Gly val Leu Lys Ala Asn Leu Ala Tyr Leu Pro Leu Asp val Asn 
3140 3145 3150 

Val Pro Ala Ser Arg Leu Glu Ala lie Leu Ser Glu Val Ser Gly Ser 
3155 3160 3165 

Met Leu Val Leu Val Gly Ala Glu Thr Pro He Pro Glu Gly Met Ala 
3170 3175 3180 

Glu Ala Glu Thr He Arg He Thr Glu He Leu Ala Asp Ala Lys Thr 
3185 3190 3195 3200 

Asp Asp He Asn Gly Leu Ala Ala Ser Gin Pro Thr Ala Ala Ser Leu 
3205 3210 3215 

Ala Tyr Val He Phe Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val 
3220 3225 3230 

Met Val Glu His Arg Gly He Val Arg Leu Thr Lys Gin Thr Asn He 
3235 3240 3245 

Thr Ser Lys Leu Pro Glu Ser Phe His Met Ala His He Ser Asn Leu 
3250 3255 3260 

Ala Phe Asp Ala Ser Val Trp Glu Val Phe Thr Thr Leu Leu Asn Gly 
3265 3270 3275 3280 

Gly Thr Leu Val Cys He Asp Tyr Phe Thr Leu Leu Glu Ser Thr Ala 
3285 3290 3295 

Leu Glu Lys Val Phe Phe Asp Gin Arg Val Asn Val Ala Leu Leu Pro 
3300 3305 3310 

Pro Ala Leu Leu Lys Gin Cys Leu Asp Asn Ser Pro Ala Leu Val Lys 
3315 3320 3325 

Thr Leu Ser Val Leu Tyr He Gly Gly Asp Arg Leu Asp Ala Ser Asp 
3330 3335 3340 

Ala Ala Lys Ala Arg Gly Leu Val Gin Thr Gin Ala Phe Asn Ala Tyr 
3345 3350 3355 3360 

Gly Pro Thr Glu Asn Thr Val Met Ser Thr He Tyr Pro He Ala Glu 
3365 3370 3375 

Asp Pro Phe He Asn Gly Val Pro He Gly His Ala Val Ser Asn Ser 
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3380 3385 3390 

Gly Ala Phe Val Met Asp Gin Asn Gin Gin He Thr Pro Pro Gly Ala 
3395 3400 3405 

Mer Gly Glu Leu He Val Thr Gly Asp Gly Leu Ala Arg Gly Tyr Thr 
3410 3415 3420 

Thr Ser Ser Leu Asn Thr Gly Arg Phe lie Asn Val Asp He Asp Gly 
3425 3430 3435 3440 

Glu Gin Val Arg Ala Tyr Arg Thr Gly Aso Arg Val Arg Tyr Arg Pro 
3445 3450 3455 

Lys Asp Leu Gin He Glu Phe Phe Gly Arg He Asp His Gin Val Lys 
3460 3465 3470 

He Arg Gly His Arg He Glu Pro Ala Glu Val Glu Tyr Ala Leu Leu 
3475 3480 3485 

Ser His Asp Leu Val Thr Asp Ala Ala Val Val Thr His Ser Gin Glu 
3490 3495 3500 

20 Asn Gin Asp Leu Glu Met Val Gly Phe Val Ala Ala Arg Val Ala Asp 

3505 3510 3515 3520 

Val Arg Glu Asp Glu Ser Ser Asn Gin Val Gin Glu Trp Gin Thr His 
3525 3530 3535 

25 Phe Asp Ser He Ala Tyr Ala Asp He Thr Thr He Asp Gin Gin Ser 

3540 3545 3550 

Leu Gly Arg Asp Phe Met Ser Trp Thr Ser Met Tyr Asp Gly Ser Leu 
3555 3560 3565 

^ He Lys Lys Ser Gin Met Gin Glu Trp Leu Asp Asp Thr Met Arg Ser 

3570 3575 3580 

Leu Leu Asp Ser Gin Pro Pro Gly His Val Leu Glu Val Gly Thr Gly 
3585 3590 3595 3600 
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Thr Gly Met Val Leu Phe Asn Leu Gly Arg Glu Gly Gly Leu Gin Ser 
3605 3610 3615 

Tyr Val Gly Leu Glu Pro Ser Pro Ser Ala Thr Ala Phe Val Asn Lys 
3620 3625 3630 

Ala Ala Lys Ser Phe Pro Gly Leu Glu Asp Arg He Arg Val Glu Val 
3635 3640 3645 

Gly Thr Ala Thr Asp He Asp Arg Leu Gly Asp Asp Leu His Ala Gly 
3650 3655 3660 

Leu Val Val Val Asn Ser Val Ala Gin Tyr Phe Pro Ser Gin Asp Tyr 
^ 3665 3670 3675 3680 

Leu Ala Gin Leu Val Arg Asp Leu Thr Lys Val Pro Gly Val Glu Arg 
3685 3690 3695 

He Phe Phe Gly Asp Met Arg Ser His Ala He Asn Arg Asp Phe Leu 
» 3700 3705 3710 

val Ala Arg Ala Val His Ala Leu Gly Asp Lys Ala Thr Lys Ala Glu 
3715 3720 3725 

He Gin Arg Glu Val Val Arg Met Glu Glu Ser Glu Asp Glu Leu Leu 
55 3730 3735 3740 
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Val Asp Pro Ala Phe Phe Thr Ser Leu Thr Thr Gin Val Glu Asn lie 
3745 3750 3755 3760 

Lys His Val Glu lie Leu Pro Lys Arg Met Arg Ala Thr Asn Glu Leu 
3765 3770 3775 

Ser Ser Tyr Arg Tyr Ala Ala Val Leu His Val Asn Asp Leu Ala Lys 
3780 3785 3790 

Pro Ala His Lys Val Ser Pro Gly Ala Trp Val Asp Phe Ala Ala Thr 
3795 3800 3805 

Lys Met Asp Arg Asp Ala Leu lie Arg Leu Leu Arg Gly Thr Lys lie 
3810 3815 3820 

Ser Asp His lie Ala lie Ala Asn lie Pro Asn Ser Lys Thr lie Val 
3825 3830 3835 3840 

Glu Arg Thr lie Cys Glu Ser Val Tyr Asp Leu Gly Gly Asp Ala Lys 
3845 3850 3855 

Asp Ser Asn Asp Arg Val Ser Trp Leu Ser Ala Ala Arg Ser Asn Ala 
3860 3865 3870 

Val Lys Val Ala Ser Leu Ser Ala lie Asp Leu Val Asp lie Ala Gin 
3875 3880 3885 

Glu Ala Gly Phe Arg Val Glu lie Ser Cys Ala Arg Gin Trp Ser Gin 
3890 3895 3900 

Asn Gly Ala Leu Asp Ala Val Phe His His Leu Gly Pro Ser Pro Gin 
3905 3910 3915 3920 

Ser Ser His Val Leu He Asp Phe Leu Thr Asp His Gin Gly Arg Pro 
3925 3930 3935 

Glu Glu Ala Leu Thr Asn His Pro Leu His Arg Ala Gin Ser Arg Arg 
3940 3945 3950 

Val Glu Arg Gin lie Arg Glu Arg Leu Gin Thr Leu Leu Pro Ala Tyr 
3955 3960 3965 

Met He Pro Ala Gin lie Met Val Leu Asp Lys Leu Pro Leu Asn Ala 
3970 3975 3980 

Asn Gly Lys Val Asp Arg Lys Gin Leu Thr Gin Arg Ala Gin Thr Val 
3985 3990 3995 4000 

Pro Lys Ala Lys Gin Val Ser Ala Pro Val Ala Pro Arg Thr Glu He 
4005 4010 4015 

Glu Arg Val Leu Cys Gin Glu Phe Ser Asp Val Leu Gly Val Asp He 
4020 4025 4030 

Gly He Met Glu Asn Phe Phe Asp Leu Gly Gly His Ser Leu Met Ala 
4035 4040 4045 

Thr Lys Leu Ala Ala Arg He Ser Arg Arg Leu Glu Thr His Val Ser 
4050 4055 4060 

Val Lys Glu He Phe Asp His Pro Arg Val Cys Asp Leu Val Leu He 
4065 4070 4075 4080 

Val Gin Gin Gly Ser Ala Pro His Asp Pro. He Val Ser Thr Lys Tyr 
4085 4090 4095 
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Thr Gly Pro Val Pro Gin Ser Phe Ala Gin Gly Arg Leu Trp Phe Leu 
4100 4105 4110 

Asp Gin Leu Asn Phe Gly Ala Thr Trp Tyr Leu Met Pro Leu Ala Val 
4115 4120 4125 

Arg Leu Arg Gly Ala Met Asn Val His Ala Leu Thr Ala Ala Leu Leu 
4130 4135 4140 

Ala Leu Glu Arg Arg His Glu Leu Leu Arg Thr Thr Phe Tyr Glu Gin 
4145 4150 4155 4160 

Asn Gly Val Gly Met Gin Lys Val Asn Pro Val Val Thr Glu Thr Leu 
4165 4170 4175 

Arg lie He Asp Leu Ser Asn Gly Asp Gly Asp Tyr Leu Pro Thr Leu 
4180 4185 4190 

Lys Lys Glu Gin Thr Ala Pro Phe His Leu Glu Thr Glu Pro Gly Trp 
4195 4200 4205 

Arg Val Ala Leu Leu Arg Leu Gly Pro Gly Asp Tyr He Leu Ser Val 
4210 4215 4220 

Val Met His His He He Ser Asp Gly Trp Ser Val Asp Val Leu Phe 
4225 4230 4235 4240 

Gin Glu Leu Gly Gin Phe Tyr Ser Thr Ala Val Lys Gly His Asp Pro 
4245 4250 4255 

Leu Ser Gin Thr Thr Pro Leu Pro He His Tyr Arg Asp Phe Ala Leu 
4260 4265 4270 

Trp Gin Lys Lys Pro Thr Gin Glu Ser Glu His Glu Arg Gin Leu Gin 
4275 4280 4285 

Tyr Trp Val Glu Gin Leu Val Asp Ser Ala Pro Ala Glu Leu Leu Thr 
4290 4295 4300 

Asp Leu Pro Arg Pro Ser He Leu Ser Gly Gin Ala Gly Glu Met Ser 
4305 4310 4315 4320 

Val Thr He Glu Gly Ala Leu Tyr Lys Asn Leu Glu Glu Phe Cys Arg 
4325 4330 4335 

Val His Arg Val Thr Ser Phe Val Val Leu Leu Ala Ala Leu Arg Ala 
4340 4345 4350 

Ala His Tyr Arg Leu Thr Gly Ser Glu Aso Ala Thr He Gly Thr Pro 
4355 4360 ' 4365 

He Ala Asn Arg Asn Arg Pro Glu Leu Glu Gin He He Gly Phe Phe 
4370 4375 4380 

^ Val Asn Thr Gin Cys He Arg He Thr Val Asn Glu Asp Glu Thr Phe 

4385 4390 4395 4400 

Glu Ser Leu Val Gin Gin Val Arg Ser Thr Ala Thr Ala Ala Phe Ala 
4405 4410 4415 

SO His Gin Asp Val Pro Phe Glu Lys He Val Ser Thr Leu Leu Pro Gly 

4420 4425 4430 

Ser Arg Asp Ala Ser Arg Asn Pro Leu Val Gin Leu Met Phe Ala Val 
4435 4440 4445 

55 His Ser Gin Lys Asn Leu Gly Glu Leu Lys Leu Glu Asn Ala His Ser 
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4450 4455 4460 

Glu Val Val Pro Thr Glu lie Thr Thr Arg Phe Asp Leu Glu Phe His 
4465 4470 4475 4480 

Leu Phe Gin Gin Asp Asp Lys Leu Glu Gly Ser He Leu Tyr Ser Thr 
4485 4490 4495 

Asp Leu Phe Glu Ala Val Ser Val Gin Ser Leu Leu Ser Val Phe Gin 
4500 4505 4510 

Glu He Leu Arg Arg Gly Leu Asn Gly Pro Asp Val Pro He Ser Thr 
4515 4520 4525 

Leu Pro Leu Gin Asp Gly lie Val Asp Leu Gin Arg Gin Gly Leu Leu 
4530 4535 4540 

Asp Val Gin Lys Thr Glu Tyr Pro Arg Asp Ser Ser Val Val Asp Val 
4545 4550 4555 4560 

Phe His Glu Gin Val Ser He Asn Pro Asp Ser He Ala Leu He His 
4565 4570 4575 

Gly Ser Glu Lys Leu Ser Tyr Ala Gin Leu Asp Arg Glu Ser Asp Arg 
4580 4585 4590 

Val Ala Arg Trp Leu Arg His Arg Ser Phe Ser Ser Asp Thr Leu He 
4595 4600 4605 

Ala Val Leu Ala Pro Arg Ser Cys Glu Thr He He Ala Phe Leu Gly 
4610 4615 4620 

He Leu Lys Ala Asn Leu Ala Tyr Leu Pro Leu Asp Val Lys Ala Pro 
4625 4630 4635 4640 

Ala Ala Arg He Asp Ala He Val Ser Ser Leu Pro Gly Asn Lys Leu 
4645 4650 4655 

He Leu Leu Gly Ala Asn Val Thr Pro Pro Lys Leu Gin Glu Ala Ala 
4660 4665 4670 

He Asp Phe Val Pro He Arg Asp Thr Phe Thr Thr Leu Thr Asp Gly 
4675 4680 4685 

Thr Leu Gin Asp Gly Pro Thr He Glu Arg Pro Ser Ala Gin Ser Leu 
4690 4695 4700 

Ala Tyr Ala Met Phe Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val 
4705 4710 4715 4720 

Met Val Gin His Arg Asn He Val Arg Leu Val Lys Asn Ser Asn Val 
4725 4730 4735 

Val Ala Lys Gin Pro Ala Ala Ala Arg He Ala His He Ser Asn Leu 
4740 4745 4750 

Ala Phe Asp Ala Ser Ser Trp Glu He Tyr Ala Pro Leu Leu Asn Gly 
4755 4760 4765 

Gly Ala He Val Cys Ala Asp Tyr Phe Thr Thr He Asp Pro Gin Ala 
4770 4775 4780 

Leu Gin Glu Thr Phe Gin Glu His Glu He Arg Gly Ala Met Leu Pro 
4785 4790 4795 4800 

Pro Ser Leu Leu Lys Gin Cys Leu Val Gin Ala Pro Asp Met He Ser 
4805 4810 4815 
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Arg Leu Asp He Leu Phe Ala Ala Gly Asp Arg Phe Ser Ser Val Asp 
4820 4825 4830 

Ala Leu Gin Ala Gin Arg Leu Val Gly Ser Gly Val Phe Asn Ala Tyr 
4835 4840 4845 

Gly Pro Thr Glu Asn Thr He Leu Ser Thr He Tyr Asn Val Ala Glu 
4850 4855 4860 

Asn Asp Ser Phe Vai Asn Gly Val Pro lie Gly Ser Ala Val Ser Asn 
4865 4870 4875 4880 

Ser Gly Ala Tyr He Met Asp Lys Asn Gin Gin Leu Val Pro Ala Gly 
4885 4890 4895 

Val Met Gly Glu Leu Val Val Thr Gly Asp Gly Leu Ala Arg Gly Tyr 
4900 4905 4910 

Met Asp Pro Lys Leu Asp Ala Asp Arg Phe He Gin Leu Thr Val Asn 
4915 4920 4925 

Gly Ser Glu Gin Val Arg Ala Tyr Arg Thr Gly Asp Arg Val Arg Tyr 
4930 4935 4940 

Arg Pro Lys Asp Phe Gin He Glu Phe Phe Gly Arg Met Asp Gin Gin 
4945 4950 4955 4960 

He Lys He Arg Gly Kis Arg He Glu Pro Ala Glu Val Glu Gin Ala 
4965 4970 4975 

Phe Leu Asn Asp Gly Phe Val Glu Asp Val Ala He Val He Arg Thr 
4980 4985 4990 

Pro Glu Asn Gin Glu Pro Glu Met Val Ala Phe Val Thr Ala Lys Gly 
4995 5000 5005 

Asp Asn Ser Ala Arg Glu Glu Glu Ala Thr Thr Gin He Glu Gly Trp 
5010 5015 5020 

Glu Ala His Phe Glu Gly Gly Ala Tyr Ala Asn He Glu Glu He Glu 
5025 5030 5035 5040 

Ser Glu Ala Leu Gly Tyr Asp Phe Met Gly Trp Thr Ser Met Tyr Asp 
5045 5050 5055 

Gly Thr Glu He Asp Lys Asp Glu Met Arg Glu Trp Leu Asn Asp Thr 
5060 5065 5070 

Met Arg Ser Leu Leu Asp Gly Lys Pro Ala Gly Arg Val Leu Glu Val 
5075 5080 5085 

Gly Thr Gly Thr Gly Met He Met Phe Asn Leu Gly Arg Ser Gin Gly 
5090 5095 5100 

Leu Glu Arg Tyr He Gly Leu Glu Pro Ala Pro Ser Ala Ala Glu Phe 
5105 5110 5115 5120 

Val Asn Asn Ala Ala Lys Ser Phe Pro Gly Leu Ala Gly Arg Ala Glu 

5125 5130 5135 

Vai His Val Gly Thr Ala Ala Asp Val Gly Thr Leu Gin Gly Leu Thr 
5140 5145 5150 

Ser Asp Met Ala Val He Asn Ser Val Ala Gin Tyr Phe Pro Thr Pro 
5155 5160 5165 
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Glu Tyr Leu Ala Glu Thr lie Lys Ser Leu Val Gin Val Pro Gly Met 
5170 5175 5180 

Lys Arg He Tyr Leu Gly Asp Met Arg Ser Trp Ala Met Asn Arg Asp 
5 5185 5190 5195 5200 

Phe Ala Ala Ala Arg Ala Ala Tyr Ser Leu Ala Asp Asn Ala Ser Lys 

5205 5210 5215 

Asp Arg Val Arg Gin Lys Met Met Glu Leu Glu Glu Lys Glu Glu Glu 
10 5220 5225 5230 

Leu Leu Val Asp Pro Ala Phe Phe Thr Ala Leu Ala Ser Gin Leu Gin 
5235 5240 5245 
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Asp Arg He Gin His Val Glu He Leu Pro Lys Arg Met Lys Ala Thr 
5250 5255 5260 

Asn Glu Leu Ser Ser Tyr Arg Tyr Ala Ala Val Leu His He Ser Asp 
5265 5270 5275 5280 

Glu Pro Leu Pro He Tyr Lys He Asp Pro Glu Ala Trp He Asn Phe 
5285 5290 5295 

Glu Gly Ser Arg Leu Thr Arg Glu Ala Leu Ala Gin Val Leu Lys Glu 
5300 5305 5310 

Asn Glu Asn Ala Glu Ser Val Ala He Ser Asn He Pro Tyr Ser Lys 
5315 5320 5325 

Thr Val Val Glu Arg His He val Arg Ser Leu Asp Gin Glu Asp Ala 
5330 5335 5340 

Asn Ala Pro Glu Glu Ser Met Asp Gly Ser Asp Trp He Ser Ala Val 
5345 5350 5355 5360 

Arg Thr Arg Ala Gin Gin Cys His Thr Leu Ser Ala Ser Asp Leu Phe 

5365 5370 5375 

Asp He Ala Glu Asp Ala Gly Phe Arg Val Glu Val Ser Trp Ala Arg 
5380 5385 5390 

Gin His Ser Gin His Gly Ala Leu Asp Ala Val Phe His His Leu Lys 
5395 5400 5405 

Pro Ala Thr Glu Asp Ser Arg Val Leu He Lys Phe Pro Thr Asp His 
5410 5415 5420 

Gin Gly Arg Pro Leu Lys Ser Leu Thr Asn Gin Pro Leu Leu Pro Ala 
5425 5430 5435 5440 

Gin Ser Arg Arg Ala Glu Leu Leu He Arg Glu Gly Leu Gin Thr Leu 
5445 5450 5455 

Leu Pro Pro Tyr Met He Pro Ser Gin He Thr Leu He Asp Arg Met 
5460 5465 5470 

Pro Leu Asn Ala Asn Gly Lys Val Asp Arg Arg Glu Leu Ala Arg Arg 
5475 5480 5485 

Ala Lys He Thr Gin Lys Ser Lys Pro Val Glu Asp He Val Pro Pro 
5490 5495 5500 

Arg Asn Ser Val Glu Ala Thr Val Cys Lys Gly Phe Thr Asp Val Leu 
5505 5510 5515 5520 

Gly Val Glu Val Gly He Thr Asp Asn Phe Phe Asn Leu Gly Gly Kis 



56 



BNSDOCID: <EP_0578616A2J_> 



EP 0 578 616 A2 



10 



15 



20 



25 



5525 5530 5535 

Ser Leu Met Ala Thr Lys Leu Ala Ala Arg Leu Gly Arg Gin Leu Asn 
5540 5545 5550 

Thr Arg lie Ser val Arg Asp val Pne Asp Gin Pro val val Ala Asp 
5555 5560 5565 

Leu Ala Ala val lie Gin Arg Asn Ser Ala Pro His Glu Pro lie Lys 

5570 5575 5580 

Pro Ala Asp Tyr Thr Gly Pro Val Pro Gin Ser Phe Ala Gin Gly Arg 
5585 5590 5595 5600 

Leu Trp Phe Leu Asp Gin Leu Asn Val Gly Ala Thr Trp Tyr Leu Met 
5605 5610 5615 

Pro Leu Gly lie Arg Leu His Gly Ser Leu Arg Val Asp Ala Leu Ala 
5620 5625 5630 

Thr Ala lie Ser Ala Leu Glu Gin Arg His Glu Pro Leu Arg Thr Thr 
5635 5640 5645 

Phe His Glu Glu Asp Gly Val Gly Val Gin Val Val Gin Asp His Arg 
5650 5655 5660 

Pro Lys Asp Leu Arg lie lie Asp Leu Ser Thr Gin Pro Lys Asp Ala 
5665 5670 5675 5680 

Tyr Leu Ala Val Leu Lys His Glu Gin Thr Thr Leu Phe Asp Leu Ala 
5685 5690 5695 

Thr Glu Pro Gly Trp Arg Val Ala Leu lie Arg Leu Gly Glu Glu Glu 
5700 5705 5710 

^ His lie Leu Ser He Val Met His His He He Ser Asp Gly Trp Ser 

5715 5720 5725 

Val Glu Val Leu Phe Aso Glu Met His Arg Phe Tyr Ser Ser Ala Leu 
5730 * 5735 5740 

^ Arg Gin Gin Asp Pro Met Glu Gin He Leu Pro Leu Pro He Gin Tyr 

5745 5750 5755 5760 

Arg Asp Phe Ala Ala Trp Gin Lys Thr Glu Glu Gin Val Ala Glu His 
5765 5770 5775 

40 Gin Arg Gin Leu Asp Tyr Trp Thr Glu His Leu Ala Asp Ser Thr Pro 

5780 5785 5790 

Ala Glu Leu Leu Thr Asp Leu Pro Arg Pro Ser He Leu Ser Gly Arg 
5795 5800 5805 

45 Ala Asn Glu Leu Pro Leu Thr He Glu Gly Arg Leu His Asp Lys Leu 

5810 5815 5820 

Arg Ala Phe Cys Arg Val His Gin Ala Thr Pro Phe Val He Leu Leu 
5825 5830 5835 5840 

50 Ala Ala Leu Arg Ala Ala His Tyr Arg Leu Thr Gly Ala Glu Asp Ala 

5845 5850 5855 

Thr Leu Gly Thr Pro He Ala Asn Arg Asn Arg Pro Glu Leu Glu Asn 
5860 5865 5870 

55 Met He Gly Phe Phe Val Asn Thr Gin Cys Met Arg He Ala He Glu 

5875 5880 5885 
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Glu Asn Asp Asn Phe Glu Ser Leu Val Arg Arg Val Arg Ser Thr Ala 
5890 5895 5900 

Thr Ser Ala Phe Ala Asn Gin Asp Val Pro Phe Glu Ser lie Val Ser 
5905 5910 5915 5920 

Ser Leu Leu Pro Gly Ser Arg Asp Ala Ser Arg Asn Pro Leu Val Gin 
5925 5930 5935 

Val lie Leu Ala Vai His Ser Gin Gin Asp Leu Gly Lys Leu Thr Leu 
5940 5945 5950 

Glu Gly Leu Arg Asp Glu Ala Val Asp Ser Ala lie Ser Thr Arg Phe 
5955 5960 5965 

Asp Val Glu Phe His Leu Phe Glu His Ala Asp Arg Leu Ser Gly Ser 
" 5970 5975 5980 

Val Leu Tyr Ala Lys Glu Leu Phe Lys Leu Arg Thr He Glu Ser Val 
5985 5990 5995 6000 

Val Ser Val Phe Leu Glu Thr Leu Arg Arg Ala Leu Asp Gin Pro Leu 
2^ 6005 6010 6015 

Thr Pro Leu Ala Val Leu Pro Leu Thr Asp Gly Val Gly Glu lie Ala 
6020 6025 6030 

Ser Lys Gly Leu Leu Asp Val Pro Arg Thr Asp Tyr Pro Arg Asp Ala 
25 6035 6040 6045 

Asn lie Val Glu Val Phe Gin Gin His Val Arg Ala Thr Pro Asp Ala 
6050 6055 6060 

He Ala Val Lys Asp Ala Thr Ser He Leu Thr Tyr Ala Gin Leu Asp 
30 6065 6070 6075 6080 

Gin Gin Ser Asp Arg Leu Ala He Trp Leu Ser Arg Arg His Met Met 
6085 6090 6095 

Pro Glu Thr Leu Val Gly Val Leu Ala Pro Arg Ser Cys Glu Thr He 
35 6 1 0 0 6 1 0 5 6110 

He Ala Met Phe Gly He Met Lys Ala Asn Leu Ala Tyr Leu Pro Leu 
6115 6120 6125 
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Asp He Asn Ser Pro Ala Ala Arg Leu Arg Ser He Leu Ser Ala Val 
6130 6135 6140 

Asp Gly Asn Lys Leu Val Leu Leu Gly Ser Gly Val Thr Ala Pro Glu 
6145 6150 6155 6160 

Gin Glu Asn Pro Glu Val Glu Ala Val Gly He Gin Glu He Leu Ala 
6165 6170 6175 

Gly Thr Gly Leu Asp Lys Thr Gin Gly Ser Asn Ala Arg Pro Ser Ala 
6180 6185 6190 

Thr Ser Leu Ala Tyr Val He Phe Thr Ser Gly Ser Thr Gly Lys Pro 
6195 6200 6205 

Lys Gly Val Met Val Glu His Arg Ser Val Thr Arg Leu Ala Lys Pro 
6210 6215 6220 

Ser Asn Val He Ser Lys Leu Pro Gin Gly Ala Arg Val Ala His Leu 
6225 6230 6235 6240 
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Ala Asn lie Ala Phe Asp Ala Ser lie Trp Glu lie Ala Thr Thr Leu 
6245 6250 6255 

Leu Asn Gly Ala Thr Leu Val Cys Leu Asp Tyr His Thr Val Leu Asp 
6260 6265 6270 

Cys Arg Thr Leu Lys Glu Val Phe Glu Arg Glu Ser He Thr Val Val 
6275 6280 6285 

Thr Leu Met Pro Ala Leu Leu Lys Gin Cys Val Ala Glu He Pro Glu 
6290 6295 6300 

Thr Leu Ala His Leu Asp Leu Leu Tyr Thr Gly Gly Asp Arg Val Gly 
6305 6310 6315 6320 

Gly His ASD Ala Met Arg Ala Arg Ser Leu Val Lys He Gly Met Phe 
6325 6330 6335 

Ser Gly Tyr Gly Pro Thr Glu Asn Thr Val He Ser Thr He Tyr Glu 
6340 6345 6350 

Val Asp Ala Asp Glu Met Phe Val Asn Gly Val Pro He Gly Lys Thr 
6355 6360 6365 

Val Ser Asn Ser Gly Ala Tyr Val Met Asp Arg Asn Gin Gin Leu Val 
6370 6375 6380 

Pro Ser Gly Val Val Gly Glu Leu Val Val Thr Gly Asp Gly Leu Ala 
6385 6390 6395 6400 

Arg Gly Tyr Thr Asp Pro Ser Leu Asn Lys Asn Arg Phe He Tyr He 
6405 6410 6415 

Thr Val Asn Gly Glu Ser He Arg Ala Tyr Arg Thr Gly Asp Arg Val 
6420 6425 6430 

Arg Tyr Arg Pro His Asp Leu Gin He Glu Phe Phe Gly Arg Met Asp 
6435 6440 6445 

Gin Gin Val Lys He Arg Gly His Arg He Glu Pro Gly Glu Val Glu 
6450 6455 6460 

Ser Ala Leu Leu Ser His Asn Ser Val Gin Asp Ala Ala Val Val He 
6465 6470 6475 6480 

Cys Ala Pro Ala Asp Gin Asp Ser Gly Ala Glu Met Val Ala Phe Val 
6485 6490 6495 

Ala Ala Arg Asn Thr Glu Asp Glu Asp Thr Gin Glu Glu Glu Ala Val 
6500 6505 6510 

Asp Gin Val Gin Gly Trp Glu Thr His Phe Glu Thr Ala Ala Tyr Ser 
6515 6520 6525 

Glu Val Lys Asp He Arg Gin Ser Glu Val Gly Asn Asp Phe Met Gly 
6530 6535 6540 

Trp Thr Ser Met Tyr Asp Gly Ser Glu He Asp Lys Thr Asp Met His 
6545 6550 6555 6560 

Glu Trp Leu Asn Asp Thr Met Arg Met He Leu Asp Ala Arg Glu Pro 
6565 6570 6575 

Gly His Val Leu Glu He Gly Thr Gly Thr Gly Met Val Met Phe Asn 
6580 6585 6590 

Leu Ala Lys Cys Pro Gly Leu Gin Gly Tyr Val Gly Phe Glu Pro Ser 
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6595 6600 6605 

Lys Ser Ala Ala Gin Phe Val Asn Asp Ala Ala Gin Ser Phe Pro Ala 
6610 6615 6620 

Leu Lys Asp Gly Arg Ser lie Val His Val Gly Thr Ala Thr Asp lie 
6625 6630 6635 6640 

Asn Lys Ala Gly Pro lie Gin Pro Arg Leu Val Val lie Asn Ser Val 
6645 6650 6655 

Ala Gin Tyr Phe Pro Thr Pro Glu Tyr Leu Phe Arg Val Val Glu Ala 
6660 6665 6670 

Leu Val Gin lie Pro Ser Val Glu Arg lie Val Phe Gly Asp Met Arg 
6675 6680 6685 

Thr Asn Ala lie Asn Arg Asp Phe Val Ala Ser Arg Ala Leu His Thr 
6690 6695 6700 

Leu Gly Glu Lys Ala Asn Lys Arg Leu Val Arg Gin Met lie Tyr Glu 
6705 6710 6715 6720 

Leu Glu Ala Asn Glu Glu Glu Leu Leu Thr Asp Pro Ala Phe Phe Thr 
6725 6730 6735 

Ser Leu Arg Thr Arg Leu Gly Glu Lys lie Lys His Val Glu lie Leu 
6740 6745 6750 

Pro Lys Thr Met Lys Ala Thr Asn Glu Leu Ser Lys Tyr Arg Tyr Ala 
6755 6760 6765 

Ala Val Leu His Val Arg Gly Ser Arg Glu Gin Ser Thr lie His Gin 
6770 6775 6780 

Val Ser Pro Asn Ala Trp lie Asp Phe Ala Ala Asp Gly Leu Asp Arg 
6785 6790 6795 6800 

Gin Thr Leu lie Asn Leu Leu Lys Glu His Lys Asp Ala Gly Thr Val 
6805 6810 6815 

Ala lie Gly Asn lie Pro Tyr Ser Lys Thr lie Val Glu Arg Phe Val 
6820 6825 6830 

Asn Lys Ser Leu Ser Glu Asp Asp Met Glu Glu Gly Gin Asn Ser Leu 
6835 6840 6845 

Asp Gly Ser Ala Trp Val Ala Ala Val Arg Met Ala Ala Gin Ser Cys 
6850 6855 6860 

Pro Ser Leu Asp Ala Met Asp Val Lys Glu lie Ala Gin Glu Ala Gly 
6865 6870 6875 6880 

Tyr Gin Val Glu Val Ser Trp Ala Arg Gin Trp Ser Gin Asn Gly Ala 
6885 6890 6895 

Leu Asp Ala lie Phe His His Phe Glu Pro Pro Lys Glu Gly Ala Arg 
6900 6905 6910 

Thr Leu lie Glu Phe Pro Thr Asp Tyr Glu Gly Arg Asn Val Asn Thr 
6915 6920 6925 

Leu Thr Asn Arg Pro Leu Asn Ser lie Gin Ser Arg Arg Leu Gly Thr 
6930 6935 6940 

Gin lie Arg Glu Lys Leu Gin Thr Leu Leu Pro Pro Tyr Met lie Pro 
6945 6950 6955 6960 
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Ser Arg lie Met Val Leu Asp Gin Met Pro Val Asn Asn Asn Gly Lys 
6965 6970 6975 

He Asp Arg Lys Glu Leu Val Arg Arg Ala lie Val Ala Pro Lys Pro 
5 6980 6985 6990 

Arg Ser Ala Ala Thr Arg Val Ala Pro Arg Asn Glu He Glu Ala He 
6995 7000 7005 

Leu Arg Asp Glu Phe Glu Asp Val Leu Gly Thr Glu Val Ser Val Leu 
10 7010 7015 7020 

Asp Asn Phe Phe Asp Leu Gly Gly His Ser Leu Met Ala Thr Lys Leu 
7025 7030 7035 7040 

Ala Ala Arg Val Ser Arg Arg Leu Asp Ala His He Ser He Lys Asp 
15 7045 7050 7055 

Val Phe Asp Gin Pro Val Leu Ala Asp Leu Ala Ala Ser He Gin Arg 
7060 7065 7070 

Glu Ser Ala Pro His Glu Pro He Pro Gin Arg Pro Tyr Thr Gly Pro 
20 IQIS 7080 7085 

Ala Glu Gin Ser Phe Ala Gin Gly Arg Leu Trp Phe Leu Asp Gin Leu 
7090 7095 7100 
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Asn Leu Gly Ala Thr Trp Tyr Leu Met Pro Leu Ala He Arg He Arg 
7105 7110 7115 7120 

Gly Gin Leu Arg Val Ala Ala Leu Ser Ala Ala Leu Phe Ala Leu Glu 
7125 7130 7135 

Arg Arg His Glu Thr Leu Arg Thr Thr Phe Glu Glu Ser Asp Gly Val 
7140 7145 7150 

Gly val Gin He Val Gly Glu Ala Arg Asn Ser Asp Leu Arg Val His 
7155 7160 7165 

Asp Val Ser Thr Gly Asp Asp Gly Glu Tyr Leu Glu Val Leu Arg Arg 
7170 7175 7180 

Glu Gin Thr Val Pro Phe Asp Leu Ser Ser Glu Pro Gly Trp Arg Val 
7185 7190 7195 7200 

Cys Leu Val Lys Thr Gly Glu Glu Asp His Val Leu Ser He Val Met 
7205 7210 7215 

His His He He Tyr Asp Gly Trp Ser Val Asp He Leu Arg Gly Glu 
7220 7225 7230 

Leu Gly Gin Phe Tyr Ser Ala Ala Leu Arg Gly Gin Asp Pro Leu Leu 
7235 7240 7245 

His Ala Asn Pro Leu Pro He Gin Tyr Arg Asp Phe Ala Ala Trp Gin 
7250 7255 7260 

Arg Glu Ala Lys Gin Val Glu Glu His Gin Arg Gin Leu Gly Tyr Trp 
7265 7270 7275 7280 

Ser Lys Gin Leu Val Asp Ser Thr Pro Ala Glu Leu Leu Thr Asp Leu 
7285 7290 7295 

Pro Arg Pro Ser He Leu Ser Gly Arg Ala Gly Ser Val Asp Val Thr 
7300 7305 7310 
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lie Glu Giy Ser Val Tyr Gly Ala Leu Gin Ser Phe Cys Arc Thr Ara 
7315 7320 7325 

Ser Val Thr Thr Phe Val Val Leu Leu Thr Val Phe Ara He Ala H-- s 
7330 7335 7340 

Phe Arg Leu Thr Ala Val Asp Asp Ala Thr He Gly Thr Pre He Ala 
7345 7350 7355 7360 

Asn Arg Asn Arg Pro Glu Leu Glu Thr Leu Val Gly Cys Phe Val Asn 
7365 7370 7375 

Thr Gin Cys Met Arg He Ser He Ala Asp Asp Asp Asn Phe Glu Gly 
7380 7385 7390 

Leu Val Arg Gin Val Arg Asn Val Ala Thr Ala Ala Tyr Ala Asn Gin 
7395 7400 7405 

Asp Val Pro Phe Glu Arg He Val Ser Ala Leu Val Pro Gly Ser Ara 
7410 7415 7420 

Asn Thr Ser Arg Asn Pro Leu Val Gin Leu Met Phe Ala Val Gin Ser 
7425 7430 7435 7440 

Val Glu Asp Tyr Asp Gin Val Arg Leu Glu Gly Leu Glu Ser Val Met 
7445 7450 7455 

Met Pro Gly Glu Ala Ser Thr Arg Phe Asp Met Glu Phe His Leu Val 
7460 7465 7470 

Pro Giy Asp Gin Lys Leu Thr Gly Ser Val Leu Tyr Ser Ser Asp Leu 
7475 7480 7485 

Phe Glu Gin Gly Thr He Gin Asn Phe Val Asp He Phe Gin Glu Cvs 
7490 7495 7500 

Leu Arg Ser Val Leu Asp Gin Pro Leu Thr Pro He Ser Val Leu Pro 
7505 7510 7515 7520 

Phe Ser Asn Ala He Ser Asn Leu Glu Ser Leu Asp Leu Leu Glu Met 
7525 7530 7535 

Pro Thr Ser Asp Tyr Pro Arg Asp Arg Thr Val Val Asp Leu Phe Ara 
7540 7545 7550 

Glu Gin Ala Ala He Cys Pro Asp Ser He Ala Val Lys Asp Ser Ser 
7555 7560 7565 

Ser Gin Leu Thr Tyr Ala Gin Leu Asp Glu Gin Ser Asp Arg Val Ala 
7570 7575 7580 

Ala Trp Leu His Glu Arg His Met Pro Ala Glu Ser Leu Val Gly Val 
7585 7590 7595 7600 

Leu Ser Pro Arg Ser Cys Glu Thr He He Ala Tyr Phe Gly He Met 
7605 7610 7615 

Lys Ala Asn Leu Ala Tyr Leu Pro Leu Asp Val Tyr Ala Pro Asp Ala 
7620 7625 7630 

Arg Leu Ala Ala He Leu Asp Thr Val Glu Gly Glu Arg Leu Leu Leu 
7635 7640 7645 

Leu Gly Ala Gly Val Pro Gin Pro Gly He Gin He Pro Arg Leu Ser 
76:>0 7655 7660 

Thr Ala Tyr He Ala Glu Ala Leu Ser His Ala Thr Thr Val Asp Val 
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Thr Ser lie Pro Gin Pro Ser Ala Thr Ser Leu Ala Tyr Val lie Phe 
7685 7690 7695 

Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Val Met lie Glu His Arg 
7700 7705 7710 

Gly lie Val Arg Leu Val Arg Asp Thr Asn Val Asn Val Phe Pro Glu 
7715 7720 7725 

Ser Gly Ser Ala Leu Pro Val Ser His Phe Ser Asn Leu Ala Trp Asp 
7730 7735 7740 

Ala Ala Thr Trp Glu He Tyr Thr Ala Val Leu Asn Gly Gly Thr Val 
7745 7750 7755 7760 

Val Cys He Asp Arg Asp Thr Met Leu Asp He Ala Ala Leu Asn Ser 
7765 7770 7775 

Thr Phe Arg Lys Glu Asn Val Arg Ala Ala Phe Phe Thr Pro Ala Phe 
7780 7785 7790 

Leu Lys Gin Cys Leu Ala Glu Thr Pro Glu Leu Val Ala Asn Leu Glu 
7795 7800 7805 

He Leu His Thr Ala Gly Asp Arg Leu Asp Pro Gly Asp Ala Asn Leu 
7810 7815 7820 

Ala Gly Lys Thr Ala Lys Gly Gly He Phe Asn Val Leu Gly His Thr 
7825 7830 7835 7840 

Glu Asn Thr Ala Tyr Ser Thr Phe Tyr Pro Val Val Gly Glu Glu Thr 
7845 7850 7855 

Phe Val Asn Gly Val Pro Val Gly Arg Gly He Ser Asn Ser His Ala 
7860 7865 7870 

Tyr He He Asp Arg His Gin Lys Leu Val Pro Ala Gly val Met Gly 
7875 7880 7885 

Glu Leu He Leu Thr Gly Asp Gly Val Ala Arg Gly Tyr Thr Asp Ser 
7890 7895 7900 

Ala Leu Asn Lys Asp Arg Phe Val Tyr He Asp He Asn Gly Lys Ser 
7905 7910 7915 7920 

Thr Trp Ser Tyr Arg Thr Gly Asp Lys Ala Arg Tyr Arg Pro Arg Asp 
7925 7930 7935 

Gly Gin Leu Glu Phe Phe Gly Arg Met Asp Gin Met Val Lys He Arg 
7940 7945 7950 

45 Gly Val Arg He Glu Pro Gly Glu Val Glu Leu Thr Leu Leu Asp His 

7955 7960 7965 

Lys Ser Val Leu Ala Ala Thr Val Val Val Arg Arg Pro Pro Asn Gly 
7970 7975 7980 

50 Asp Pro Glu Met He Ala Phe He Thr He Asp Ala Glu Asp Asp Val 

7985 7990 7995 8000 

Gin Thr His Lys Ala He Tyr Lys His Leu Gin Gly He Leu Pro Ala 
8005 8010 8015 

55 Tyr Met He Pro Ser His Leu Val He Leu Asp Gin Met Pro Val Thr 

8020 8025 8030 
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Asp Asn Gly Lys Val Asp Arg Lys Asp Leu Ala Leu Arg Ala Gin Thr 
8035 8040 8045 

Val Gin Lys Arg Arg Ser Thr Ala Ala Arg Val Pro Pro Arg Asp Glu 
8050 8055 8060 

Val Glu Ala Val Leu Cys Glu Glu Tyr Ser Asn Leu Leu Glu Val Glu 
8065 8070 8075 8080 

Val Gly lie Thr Asp Gly Phe Phe Asp Leu Gly Gly His Ser Leu Leu 
8085 8090 8095 

Ala Thr Lys Leu Ala Ala Arg Leu Ser Arg Gin Leu Asn Thr Arg Val 
8100 8105 8110 

Ser Val Lys Asp Val Phe Asp Gin Pro He Leu Ala Asp Leu Ala Asp 
8115 8120 8125 

He He Arg Arg Gly Ser His Arg Kis Asp Pro He Pro Ala Thr Pro 
8130 8135 8140 

Tyr Thr Gly Pro Val Glu Gin Ser Phe Ala Gin Gly Arg Leu Trp Phe 
8145 8150 8155 8160 

Leu Glu Gin Leu Asn Leu Gly Ala Ser Trp Tyr Leu Met Pro Phe Ala 
8165 8170 8175 

He Arg Met Arg Gly Pro Leu Gin Thr Lys Ala Leu Ala Val Ala Leu 
8180 8185 8190 

Asn Ala Leu Val His Arg His Glu Ala Leu Arg Thr Thr Phe Glu Asp 
8195 8200 8205 

His Asp Gly Val Gly Val Gin Val He Gin Pro Lys Ser Ser Gin Asp 
8210 8215 8220 

Leu Arg He He Asp Leu Ser Asp Ala Val Asp Asp Thr Ala Tyr Leu 
8225 8230 8235 8240 

Ala Ala Leu Lys Arg Glu Gin Thr Thr Ala Phe Aso Leu Thr Ser Glu 
8245 8250 * 8255 

Pro Gly Trp Arg Val Ser Leu Leu Arg Leu Gly Asp Asp Asp Tyr He 
8260 8265 8270 

Leu Ser He Val Met His His He He Ser Asp Gly Trp Thr Val Asp 
8275 8280 8285 

Val Leu Arg Gin Glu Leu Gly Gin Phe Tyr Ser Ala Ala He Arg Gly 
8290 8295 8300 

Gin Glu Pro Leu Ser Gin Ala Lys Ser Leu Pro He Gin Tyr Ara Asp 
"05 8310 8315 8320 

Phe Ala Val Trp Gin Arg Gin Glu Asn Gin He Lys Glu Gin Ala Lys 
8325 8330 8335 

Gin Leu Lys Tyr Trp Ser Gin Gin Leu Ala Asp Ser Thr Pro Cys Glu 
8340 8345 8350 

Phe Leu Thr Asp Leu Pro Arg Pro Ser He Leu Ser Gly Glu Ala Asp 
8355 8360 8365 

Ala Val Pro Met Val He Asp Gly Thr Val Tyr Gin Leu Leu Thr Asp 
8370 8375 8380 



64 



EP 0 578 616 A2 



Phe Cys Arg Thr His Gin Val Thr Ser Phe Ser Val Leu Leu Ala Ala 
8385 8390 8395 8400 

Phe Arg Thr Ala His Tyr Arg Leu Thr Gly Thr Leu Asp Ala Thr Val 
8405 8410 8415 

Gly Thr Pro lie Ala Asn Arg Asn Arg Pro Glu Leu Glu Gly Leu lie 
8420 8425 8430 

Gly Phe Phe Val Asn Thr Gin Cys Met Arg Met Ala lie Ser Glu Thr 
8435 8440 8445 

Glu Thr Phe Glu Ser Leu Val Gin Gin Val Arg Leu Thr Thr Thr Glu 
8450 8455 8460 

Ala Phe Ala Asn Gin Asp Val Pro Phe Glu Gin He Val Ser Thr Leu 
8465 8470 8475 8480 

Leu Pro Gly Ser Arg Asp Thr Ser Arg Asn Pro Leu Val Gin Val Met 
8485 8490 8495 

Phe Ala Leu Gin Ser Gin Gin Asp Leu Gly Arg He Gin Leu Glu Gly 
8500 8505 8510 

Met Thr Asp Glu Ala Leu Glu Thr Pro Leu Ser Thr Arg Leu Asp Leu 
8515 8520 8525 

Glu Val His Leu Phe Gin Glu Val Gly Lys Leu Ser Gly Ser Leu Leu 
8530 8535 8540 

Tyr Ser Thr Asp Leu Phe Glu Val Glu Thr He Arg Gly He Val Asp 
8545 8550 8555 8560 

Val Phe Leu Glu He Leu Arg Arg Gly Leu Glu Gin Pro Lys Gin Arg 
8565 8570 8575 

Leu Met Ala Met Pro He Thr Asp Gly He Thr Lys Leu Arg Asp Gin 
8580 8585 8590 

Gly Leu Leu Thr Val Ala Lys Pro Ala Tyr Pro Arg Glu Ser Ser Val 
8595 8600 8605 

He Asp Leu Phe Arg Gin Gin Val Ala Ala Ala Pro Asp Ala He Ala 
8610 8615 8620 

Val Trp Asp Ser Ser Ser Thr Leu Thr Tyr Ala Asp Leu Asp Gly Gin 
8625 8630 8635 8640 

Ser Asn Lys Leu Ala His Trp Leu Cys Gin Arg Asn Met Ala Pro Glu 
8645 8650 8655 

Thr Leu Val Ala Val Phe Ala Pro Arg Ser Cys Leu Thr He Val Ala 
8660 8665 8670 

Phe Leu Gly Val Leu Lys Ala Asn Leu Ala Tyr Leu Pro Leu Asp Val 
8675 8680 8685 

Asn Ala Pro Ala Ala Arg He Glu Ala He Leu Ser Ala Val Pro Gly 
8690 8695 8700 

His Lys Leu Val Leu Val Gin Ala His Gly Pro Glu Leu Gly Leu Thr 
8705 8710 8715 8720 

Met Ala Asp Thr Glu Leu Val Gin He Aso Glu Ala Leu Ala Ser Ser 
8725 8730 8735 

Ser Ser Gly Asp His Glu Gin He His Ala Ser Gly Pro Thr Ala Thr 
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8740 8745 8750 

Ser Leu Ala Tyr Val Met Phe Thr Ser Gly Ser Thr Gly Lys Pro Lys 
8755 8760 8765 

Gly Val Met lie Asp His Arg Ser lie lie Arg Leu val Lys Asn Ser 
8770 8775 8780 

Asp Val Val Ala Thr Leu Pro Thr Pro Val Arg Met Ala Asn Val Ser 
8785 8790 8795 8800 

Asn Leu Ala Phe Asp lie Ser Val Gin Glu lie Tyr Thr Ala Leu Leu 
8805 8810 8815 

Asn Gly Gly Thr Leu Val Cys Leu Asp Tyr Leu Thr Leu Leu Asp Ser 
8820 8825 8830 

Lys lie Leu Tyr Asn Val Phe Val Glu Ala Gin Val Asn Ala Ala Met 
8835 8840 8845 

Phe Thr Pro Val Leu Leu Lys Gin Cys Leu Gly Asn Met Pro Ala lie 
8850 8855 8860 

lie Ser Arg Leu Ser Val Leu Phe Asn Val Gly Asp Arg Leu Asp Ala 
8865 8870 8875 8880 

His Asp Ala Val Ala Ala Ser Gly Leu lie Gin Asp Ala Val Tyr Asn 
8885 8890 8895 

Ala Tyr Gly Pro Thr Glu Asn Gly Met Gin Ser Thr Met Tyr Lys Val 
8900 8905 8910 

Asp Val Asn Glu Pro Phe Val Asn Gly Val Pro lie Gly Arg Ser He 
8915 8920 8925 

Thr Asn Ser Gly Ala Tyr Val Met Asp Gly Asn Gin Gin Leu Val Ser 
8930 8935 8940 

Pro Gly Val Met Gly Glu He Val Val Thr Gly Asp Gly Leu Ala Arg 
8945 8950 8955 8960 

35 Gly Tyr Thr Asp Ser Ala Leu Asp Glu Asp Arg Phe Val His Val Thr 

8965 8970 8975 

He Asp Gly Glu Glu Asn lie Lys Ala Tyr Arg Thr Gly Asp Arg Val 
8980 8985 8990 

40 Arg Tyr Arg Pro Lys Asp Phe Glu He Glu Phe Phe Gly Arg Met Asp 

8995 9000 9005 

Gin Gin Val Lys He Arg Gly His Arg He Glu Pro Ala Glu Val Glu 
9010 9015 9020 

45 His Ala Leu Leu Gly His Asp Leu Val His Asp Ala Ala Val Val Leu 

9025 9030 9035 9040 

Arg Lys Pro Ala Asn Gin Glu Pro Glu Met He Ala Phe He Thr Ser 
9045 9050 9055 

^ Gin Glu Asp Glu Thr He Glu Gin His Glu Ser Asn Lys Gin Val Gin 

9060 9065 9070 

Gly Trp Gly Glu His Phe Asp Val Ser Arg Tyr Ala Asp He Lys Asp 
9075 9080 9085 

„ Leu Asp Thr Ser Thr Phe Gly His Asp Phe Leu Gly Trp Thr Ser Met 

9090 9095 9100 



66 



BNSCXX^IO. <EP_0S78616A2J_> 



EP 0 578 616 A2 



Tyr Asp Gly Val Asp lie Pro Val Asn Giu Met Lys Glu Trp Leu Asp 
9105 9110 9115 9120 

Glu Thr Thr Ala Ser Leu Leu Asp Asn Arg Pro Pro Gly His lie Leu 
5 9125 9130 9135 

Glu He Gly Ala Gly Thr Gly Met lie Leu Ser Asn Leu Gly Lys Val 
9140 9145 9150 

Asp Gly Leu Gin Lys Tyr Val Gly Leu Asp Pro Ala Pro Ser Ala Ala 
9155 9160 9165 

He Phe Val Asn Glu Ala Val Lys Ser Leu Pro Ser Leu Ala Gly Lys 
9170 9175 9180 

Ala Arg Val Leu Val Gly Thr Ala Leu Asp He Gly Ser Leu Asp Lys 
*5 93^35 9190 9195 9200 

Asn Glu He Gin Pro Glu Leu Val Val He Asn Ser Val Ala Gin Tyr 
9205 9210 9215 

Phe Pro Thr Ser Glu Tyr Leu He Lys Val Val Lys Ala Val Val Glu 
20 9220 9225 9230 

Val Pro Ser Val Lys Arg Val Phe Phe Gly Asp He Arg Ser Gin Ala 
9235 9240 9245 

Leu Asn Arg Asp Phe Leu Ala Ala Arg Ala Val Arg Ala Leu Gly Asp 
25 9250 9255 9260 

Asn Ala Ser Lys Glu Gin He Arg Glu Lys He Ala Glu Leu Glu Glu 
9265 9270 9275 9280 
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Ser Glu Glu Glu Leu Leu Val Asp Pro Ala Phe Phe Val Ser Leu Arg 
9285 9290 9295 

Ser Gin Leu Pro Asn He Lys His Val Glu Val Leu Pro Lys Leu Met 
9300 9305 9310 

Lys Ala Thr Asn Glu Leu Ser Ser Tyr Arg Tyr Ala Ala Val Leu His 
9315 9320 9325 

He Ser His Asn Glu Glu Glu Gin Leu Leu He Gin Asp He Asp Pro 
9330 9335 9340 

Thr Ala Trp Val Asp Phe Ala Ala Thr Gin Lys Asp Ser Gin Gly Leu 
9345 9350 9355 9360 

Arg Asn Leu Leu Gin Gin Gly Arg Asp Asp Val Met He Ala val Gly 
9365 9370 9375 

Asn He Pro Tyr Ser Lys Thr He Val Glu Arg His He Met Asn Ser 
9380 9385 9390 

Leu Asp Gin Asp His Val Asn Ser Leu Asp Gly Thr Ser Trp He Ser 
9395 9400 9405 

Asp Ala Arg Ser Ala Ala Ala He Cys Thr Ser Phe Asp Ala Pro Ala 
9410 9415 9420 

Leu Thr Gin Leu Ala Lys Glu Glu Gly Phe Arg Val Glu Leu Ser Trp 
9425 9430 9435 9440 

Ala Arg Gin Arc Ser Gin Asn Gly Ala Leu Asp Ala Val Phe His Arg 
9445 9450 9455 
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Leu Ala Thr Asp Ala Asn Cys Glu Arg Ser Arg Val Leu Val His Phe 
9460 9465 9470 

Pro Thr Asp His Gin Gly Arg Gin Leu Arg Thr Leu Thr Asn Arg Pro 
9475 9480 9485 

Leu Gin Arg Ala Gin Ser Arg Arg lie Glu Ser Gin Val Phe Glu Ala 
9490 9495 9500 

Leu Gin Thr Ala Leu Pro Ala Tyr Met lie Pro Ser Arg lie lie Val 
9505 9510 9515 9520 

Leu Pro Gin Met Pro Thr Asn Ala Asn Gly Lys Val Asp Arg Lys Gin 
5525 9530 9535 

Leu Ala Arg Arg Ala Gin Val Val Ala Lys Arg Lys Ala Val Ser Ala 
9540 9545 9550 

Arg Val Ala Pro Arg Asn Asp Thr Glu He Val Leu Cys Glu Glu Tyr 
9555 9560 9565 

Ala Asp He Leu Gly Thr Glu Val Gly He Thr Asp Asn Phe Phe Asp 
9570 9575 9580 

Met Gly Gly His Ser Leu Met Ala Thr Lys Leu Ala Ala Arg Leu Ser 
9585 9590 9595 9600 

Arg Arg Leu Asp Thr Arg Val Thr Val Lys Glu Val Phe Asp Lys Pro 
9605 9610 9615 

Val Leu Ala Asp Leu Ala Ala Ser He Glu Gin Gly Ser Thr Pro His 
9620 9625 9630 

Leu Pro He Ala Ser Ser Val Tyr Ser Gly Pro Val Glu Gin Ser Tyr 
9635 9640 9645 

Ala Gin Gly Arg Leu Trp Phe Leu Asp Gin Phe Asn Leu Asn Ala Thr 
9650 9655 9660 

Trp Tyr His Met Ser Leu Ala Met Arg Leu Leu Gly Pro Leu Asn Met 
9665 9670 9675 9680 

Asp Ala Leu Asp val Ala Leu Arg Ala Leu Glu Gin Arg His Glu Thr 
9685 9690 9695 

Leu Arg Thr Thr Phe Glu Ala Gin Lys Asp He Gly Val Gin Val Val 
9700 9705 9710 

His Glu Ala Gly Met Lys Arg Leu Lys Val Leu Asp Leu Ser Asp Lvs 
9715 9720 9725 

Asn Glu Lys Glu His Met Ala Val Leu Glu Asn Glu Gin Met Arg Pro 
9730 9735 9740 

Phe Thr Leu Ala Ser Glu Pro Gly Trp Lys Gly His Leu Ala Arg Leu 
^^45 9750 9755 9760 

Gly Pro Thr Glu Tyr He Leu Ser Leu Val Met His His Met Phe Ser 
9765 9770 9775 

Asp Gly Trp Ser Val Asp He Leu Arg Gin Glu Leu Gly Gin Phe Tvr 
9780 9785 9790 

Ser Ala Ala Leu Arg Gly Arg Asp Pro Leu Ser Gin Val Lys Pro Leu 
9795 9800 9805 

Pro He Gin Tyr Arg Asp Phe Ala Ala Trp Gin Lys Glu Ala Ala Gin 
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9810 9815 9820 

Val Ala Glu His Glu Arg Gin Leu Ala Tyr Trp Glu Asn Gin Leu Ala 
9825 9830 9835 9840 

Asp Ser Thr Pro Gly Glu Leu Leu Thr Asp Phe Pro Arg Pro Gin Phe 
9845 9850 9855 

Leu Ser Gly Lys Ala Gly Val lie Pro Val Thr lie Glu Gly Pro Val 
9860 9865 9870 

Tyr Glu Lys Leu Leu Lys Phe Ser Lys Glu Arg Gin Val Thr Leu Phe 
9875 9880 9885 

Ser Val Leu Leu Thr Ala Phe Arg Ala Thr His Phe Arg Leu Thr Gly 
9890 9895 9900 

Ala Glu Asp Ala Thr lie Gly Thr Pro lie Ala Asn Arg Asn Arg Pro 
9905 9910 9915 9920 

Glu Leu Glu His lie lie Gly Phe Phe Val Asn Thr Gin Cys Met Arg 
9925 9930 9935 

Leu Leu Leu Asp Thr Gly Ser Thr Phe Glu Ser Leu Val Gin His Val 
9940 9945 9950 

Arg Ser Val Ala Thr Asp Ala Tyr Ser Asn Gin Asp lie Pro Phe Glu 
9955 9960 9965 

Arg lie Val Ser Ala Leu Leu Pro Gly Ser Arg Asp Ala Ser Arg Ser 
9970 9975 9980 

Pro Leu lie Gin Leu Met Phe Ala Leu His Ser Gin Pro Asp Leu Gly 
9985 9990 9995 10000 

Asn lie Thr Leu Glu Gly Leu Glu His Glu Arg Leu Pro Thr Ser Val 
10005 10010 10015 

Ala Thr Arg Phe Asp Met Glu Phe His Leu Phe Gin Glu Pro Asn Lys 
10020 10025 10030 

Leu Ser Gly Ser lie Leu Phe Ala Asp Glu Leu Phe Gin Pro Glu Thr 
10035 10040 10045 

lie Asn Ser Val Val Thr Val Phe Gin Glu lie Leu Arg Arg Gly Leu 
10050 10055 10060 

40 Asp Gin Pro Gin Val Ser lie Ser Thr Met Pro Leu Thr Asp Gly Leu 

10065 10070 10075 10080 

lie Asp Leu Glu Lys Leu Gly Leu Leu Glu lie Glu Ser Ser Asn Phe 
10085 10090 10095 

45 Pro Arg Asp Tyr Ser Val Val Asp Val Phe Arg Gin Gin Val Ala Ala 

10100 10105 10110 

Asn Pro Asn Ala Pro Ala Val Val Asp Ser Glu Thr Ser Met Ser Tyr 
10115 10120 10125 

so Thr Ser Leu Asp Gin Lys Ser Glu Gin lie Ala Ala Trp Leu His Ala 

10130 10135 10140 

Gin Gly Leu Arg Pro Glu Ser Leu lie Cys Val Met Ala Pro Arg Ser 
10145 10150 10155 10160 

55 Phe Glu Thr lie Val Ser Leu Phe Gly lie Leu Lys Ala Gly Tyr Ala 

10165 10170 10175 
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Tyr Leu Pro Leu Asp Val Asn Ser Pro Ala Ala Arg lie Gin Pro lie 
10180 10185 10190 

Leu Ser Glu Val Glu Gly Lys Arg Leu Val Leu Leu Gly Ser Gly lie 
10195 10200 10205 

Asp Met Pro Gin Ser Asp Arg Met Asp Val Glu Thr Ala Arg lie Gin 
10210 10215 10220 

Asp lie Leu Thr Asn Thr Lys Val Glu Arg Ser Asp Pro Met Ser Arg 
10225 10230 10235 10240 

Pro Ser Ala Thr Ser Leu Ala Tyr Val lie Phe Thr Ser Gly Ser Thr 
10245 10250 10255 

Gly Arg Pro Lys Gly Val Met lie Glu His Arg Asn He Leu Arg Leu 
10260 10265 10270 

Val Lys Gin Ser Asn Val Thr Ser Gin Leu Pro Gin Asp Leu Arg Met 
10275 10280 10285 

Ala His He Ser Asn Leu Ala Phe Asp Ala Ser He Trp Glu He Phe 
10290 10295 10300 

Thr Ala He Leu Asn Gly Gly Ala Leu He Cys He Asp Tyr Phe Thr 
10305 10310 10315 10320 

Leu Leu Asp Ser Gin Ala Leu Arg Thr Thr Phe Glu Lys Ala Arg Val 
10325 10330 10335 

Asn Ala Thr Leu Phe Ala Pro Ala Leu Leu Lys Glu Cys Leu Asn His 
10340 10345 10350 

Ala Pro Thr Leu Phe Glu Asp Leu Lys Val Leu Tyr He Gly Gly Asp 
10355 10360 10365 

Arg Leu Asp Ala Thr Asp Ala Ala Lys He Gin Ala Leu Val Lys Gly 
10370 10375 10380 

Thr Val Tyr Asn Ala Tyr Gly Pro Thr Glu Asn Thr Val Met Ser Thr 
10385 10390 10395 10400 

He Tyr Arg Leu Thr Asp Gly Glu Ser Tyr Ala Asn Gly Val Pro He 
10405 10410 10415 

Gly Asn Ala Val Ser Ser Ser Gly Ala Tyr He Met Asp Gin Lys Gin 
10420 10425 10430 

Arg Leu Val Pro Pro Gly Val Met Gly Glu Leu Val Val Ser Gly Asp 
10435 10440 10445 

Gly Leu Ala Arg Gly Tyr Thr Asn Ser Thr Leu Asn Ala Asp Arg Phe 
10450 10455 10460 

Val Asp He Val He Asn Asp Gin Lys Ala Arg Ala Tyr Arg Thr Gly 
10465 10470 10475 10480 

Asp Arg Thr Arg Tyr Arg Pro Lys Asp Gly Ser He Glu Phe Phe Gly 
10485 10490 10495 

Arg Met Asp Gin Gin Val Lys He Arg Gly His Arg Val Glu Pro Ala 
10500 10505 10510 

Glu Val Glu Gin Ala Met Leu Gly Asn Lys Ala He His Asp Ala Ala 
10515 10520 10525 
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Val Val Val Gin Ala Val Asp Gly Gin Glu Thr Glu Met lie Gly Phe 
10530 10535 10540 

val Ser Met Ala Ser Asp Arg Phe Ser Glu Gly Glu Glu Glu lie '^hr 
10545 10550 10555 10560 

Asn Gin Val Gin Glu Trp Glu Asp His Phe Glu Ser Thr Ala Tyr Ala 
10565 10570 10575 

Gly lie Glu Ala lie Asp Gin Ala Thr Leu Gly Arg Asp Phe Thr Ser 
10580 10585 10590 

Trp Thr Ser Met Tyr Asn Gly Asn Leu He Asp Lys Ala Glu Met Glu 
10595 10600 10605 

Glu Trp Leu Asp Asp Thr Met Gin Ser Leu Leu Asp Lys Glu Asp Ala 
10610 10615 10620 

Arg Pro Cys Ala Glu He Gly Thr Gly Thr Gly Met Val Leu Phe Asn 
10625 10630 10635 10640 

Leu Pro Lys Asn Asp Gly Leu Glu Ser Tyr Val Gly He Glu Pro Ser 
10645 10650 10655 

Arg Ser Ala Ala Leu Phe Val Asp Lys Ala Ala Gin Asp Phe Pro Gly 
10660 10665 10670 

Leu Gin Gly Lys Thr Gin He Leu Val Gly Thr Ala Glu Asp He Lys 
10675 10680 10685 

Leu Val Lys Asp Phe His Pro Asp Val Val Val He Asn Ser Val Ala 
10690 10695 10700 

Gin Tyr Phe Pro Ser Arg Ser Tyr Leu Val Gin He Ala Ser Glu Leu 
10705 10710 10715 10720 

He His Met Thr Ser Val Lys Thr He Phe Phe Gly Asp Met Arg Ser 
10725 10730 10735 

Trp Ala Thr Asn Arg Asp Phe Leu Val Ser Arg Ala Leu Tyr Thr Leu 
10740 10745 10750 

Gly Asp Lys Ala Thr Lys Asp Gin He Arg Gin Glu Val Ala Arg Leu 
10755 10760 10765 

Glu Glu Asn Glu Asp Glu Leu Leu Val Asp Pro Ala Phe Phe Thr Ser 
10770 10775 10780 

^'^iJ^I^^ Val Lys His Val Glu He Leu Pro 

10785 10790 10795 10800 

Lys Arg Met Arg Thr Ser Asn Glu Leu Ser Ser Tyr Arg Tyr Ala Ala 
10805 10810 10815 

Val Leu His He Cys Arg Asp Gly Glu Gly Arg Asn Arg Tyr Gly Ara 
10820 10825 10830 

Arg val His Ser Val Glu Glu Asn Ala Trp He Asp Phe Ala Ser Ser 
10835 10840 10845 

Gly Met Asp Arg His Ala Leu Val Gin Met Leu Asp Glu Arg Ara Asp 
10850 10855 10860 ^ ^ 

^if .c^^ '^^'^ His Ser Asn Thr He Asn 

10865 10870 10875 10880 

55 Glu Arg His Phe Thr Thr Ser Leu Asp Thr Glu Gly Glu Gly He Ala 
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10885 10890 10895 

Gin Asp Ser Leu Asp Gly Ser Ala Trp Gin Ser Ala Thr Lys Ala Met 
10900 10905 10910 

Ala Ala Arg Cys Pro Cys Leu Ser Val Thr Glu Leu Val Glu lie Gly 
10915 10920 10925 

Gin Ala Ala Gly Phe Arg Val Glu Val Ser Trp Ala Arg Gin Arg Ser 
10930 10935 10940 

Gin His Gly Ala Leu Asp Val Val Phe His His Leu Glu Asp Asp Arg 
10945 10950 10955 10960 

Val Gly Arg Val Leu lie Asn Phe Pro Thr Asp Phe Glu Arg Leu Pro 
10965 10970 10975 

Pro Ser Thr Gly Leu Thr Ser Arg Pro Leu Gin Arg lie Gin Asn Arg 
10980 10985 10990 

Arg Phe Glu Ser Gin lie Arg Glu Gin Leu Gin Thr Leu Leu Pro Pro 
10995 11000 11005 

Tyr Met Val Pro Ser Arg lie Val Val Leu Glu Arg Met Pro Leu Asn 
11010 11015 11020 

Ala Asn Ser Lys Val Asp Arg Lys Glu Leu Ala Arg Lys Ala Arg Thr 
11025 11030 11035 11040 

Leu Gin Thr He Lys Pro Ser Ala Thr Arg Val Ala Pro Arg Asn Asp 
11045 11050 11055 

He Glu Ala Val Leu Cys Asp Glu Phe Gin Ala Val Leu Gly Val Thr 
11060 11065 11070 

Val Gly Val Met Asp Asn Phe Phe Glu Leu Gly Gly His Ser Leu Met 
11075 11080 11085 

Ala Thr Lys Leu Ala Ala Arg Leu Ser Arg Arg Leu Asp Thr Arg Val 
11090 11095 11100 

Ser Val Lys Asp He Phe Asn Gin Pro He Leu Gin Asp Leu Ala Asp 
11105 11110 11115 11120 

Val Val Gin Thr Gly Ser Ala Pro His Glu Ala He Pro Ser Thr Pro 
11125 11130 11135 

Tyr Ser Gly Pro Val Glu Gin Ser Phe Ser Gin Gly Arg Leu Trp Phe 
11140 11145 11150 

Leu Asp Gin Leu Asn Leu Asn Ala Ser Trp Tyr His Met Pro Leu Ala 
11155 11160 11165 

45 Ser Arg Leu Arg Gly Pro Leu Arg He Glu Ala Leu Gin Ser Ala Leu 

11170 11175 11180 

Ala Thr He Glu Ala Arg His Glu Ser Leu Arg Thr Thr Phe Glu Glu 
11185 11190 11195 11200 

50 Gin Asp Gly Val Pro Val Gin He Val Arg Ala Ala Arg Asn Lys Gin 

11205 11210 11215 

Leu Arg He He Asp Val Ser Gly Thr Glu Asp Ala Tyr Leu Ala Ala 
11220 11225 11230 

55 Leu Lys Gin Glu Gin Asp Ala Ala Phe Asp Leu Thr Ala Glu Pro Gly 

11235 11240 11245 
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10 



Trp Arg Val Ala Leu Leu Arg Leu Gly Pro Asp Asp His Vai Leu Ser 
11250 11255 11260 

lie Vai Met Kis His lie lie Ser Asp Gly Trp Ser Val Asp lie Leu 
11265 11270 11275 11280 

Arc Gin Glu Leu Gly Gin Leu Tvr Ser Asn Ala Ser Ser Gin Pro Ala 
11285 ' 11290 11295 

Pro Leu Pro lie Gin Tyr Arg Asp Phe Ala lie Trp Gin Lys Gin Asp 

11300 11305 11310 

Ser Gin lie Ala Glu His Gin Lvs Gin Leu Asn Tyr Trp Lys Arg Gin 
11315 11320 11325 

Leu Val Asn Ser Lys Pro Ala Giu Leu Leu Ala Asp Phe Thr Arg Pro 

^5 11330 11335 11340 

Lys Ala Leu Ser Gly Asp Ala Asp Val lie Pro He Glu He Asp Asp 
11345 11350 11355 11360 

Gin Val Tyr Gin Asn Leu Arg Ser Phe Cys Arg Ala Arg His Val Thr 
20 11365 11370 11375 

Ser Phe Val Ala Leu Leu Ala Ala Phe Arg Ala Ala His Tyr Arg Leu 
11380 11385 11390 

Thr Gly Ala Glu Asp Ala Thr He Gly Ser Pro He Ala Asn Arg Asn 
25 11395 11400 11405 

Arg Pro Glu Leu Glu Gly Leu He Gly Cys Phe Val Asn Thr Gin Cys 
11410 11415 11420 
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Leu Arg He Pro Val Lys Ser Glu Asp Thr Phe Asp Thr Leu Val Lys 
11425 11430 11435 11440 

Gin Ala Arg Glu Thr Ala Thr Glu Ala Gin AsD Asn Gin Asp Val Pro 
11445 11450 11455 

Phe Glu Arg He Val Ser Ser Met Val Ala Ser Ser Arg Asp Thr Ser 
11460 11465 11470 

Arg Asn Pro Leu Val Gin Val Met Phe Ala Val His Ser Gin His Asp 
11475 11480 11485 

Leu Gly Asn He Arg Leu Glu Gly Val Glu Gly Lys Pro Val Ser Met 
11490 11495 11500 

Ala Ala Ser Thr Arg Phe Asp Ala Glu Met His Leu Phe Glu Asp Gin 
11505 11510 11515 11520 

Gly Met Leu Gly Gly Asn Val Val Phe Ser Lys Asp Leu Phe Glu Ser 
11525 11530 11535 

Glu -Thr He Arg Ser Val Val Ala Val Phe Gin Glu Thr Leu Arg Arg 
11540 11545 11550 

Gly Leu Ala Asn Pro His Ala Asn Leu Ala Thr Leu Pro Leu Thr Asp 
11555 11560 11565 

Gly Leu Pro Ser Leu Arg Ser Leu Cys Leu Gin Val Asn Gin Pro Asp 
11570 11575 11580 

Tyr Fro Arg Asp Ala Ser Val He Asp Val Phe Arg Glu Gin Val Ala 
11585 11590 11595 11600 
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Ser lie Pro Lys Ser lie Ala Val lie Asp Ala Ser Ser Gin Leu Thr 
11605 11610 11615 

Tyr Thr Glu Leu Asp Glu Arg Ser Ser Gin Leu Ala Thr Trp Leu Arg 
5 11620 11625 11630 

Arg Gin val Thr Val Pro Glu Glu Leu Val Gly Val Leu Ala Pro Arg 
11635 11640 11645 

Ser Cys Glu Thr lie lie Ala Phe Leu Gly lie lie Lys Ala Asn Leu 
10 11650 11655 11660 

Ala Tyr Leu Pro Leu Asp Val Asn Ala Pro Ala Gly Arg lie Glu Thr 
11665 11670 11675 11680 

lie Leu Ser Ser Leu Pro Gly Asn Arg Leu lie Leu Leu Gly Ser Asp 
IS 11685 11690 11695 

Thr Gin Ala Val Lys Leu Kis Ala Asn Ser Val Arg Phe Thr Arg lie 
11700 11705 11710 
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Ser Asp Ala Leu Val Glu Ser Gly Ser Pro Pro Thr Glu Glu Leu Ser 
11715 11720 11725 

Thr Arg Pro Thr Ala Gin Ser Leu Ala Tyr Val Met Phe Thr Ser Gly 
11730 11735 11740 

Ser Thr Gly Val Pro Lys Gly Val Met Val Glu His Arg Gly He Thr 
11745 11750 11755 11760 

Arg Leu Val Lys Asn Ser Asn Val Val Ala Lys Gin Pro Ala Ala Ala 
11765 11770 11775 

Ala lie Ala His Leu Ser Asn lie Ala Phe Asp Ala Ser Ser Trp Glu 
11780 11785 11790 

He Tyr Ala Pro Leu Leu Asn Gly Gly Thr Val Val Cys lie Asp Tyr 
11795 11800 11805 

Tyr Thr Thr He Asp He Lys Ala Leu Glu Ala Val Phe Lys Gin His 
11810 11815 11820 

His lie Arg Gly Ala Met Leu Pro Pro Ala Leu Leu Lys Gin Cys Leu 
11825 11830 11835 11840 

Val Ser Ala Pro Thr Met He Ser Ser Leu Glu He Leu Phe Ala Ala 
11845 11850 11855 

Gly Asp Arg Leu Ser Ser Gin Asp Ala He Leu Ala Arg Arg Ala Val 
11860 11865 11870 

Gly Ser Gly Val Tyr Asn Ala Tyr Gly Pro Thr Glu Asn Thr Val Leu 
11875 11880 11885 

Ser Thr He His Asn He Gly Glu Asn Glu Ala Phe Ser Asn Gly Val 
11890 11895 11900 

Pro He Gly Asn Ala Val Ser Asn Ser Gly Ala Phe Val Met Asp Gin 
11905 11910 11915 11920 

Asn Gin Gin Leu Val Ser Ala Gly Val He Gly Glu Leu Val Val Thr 
11925 11930 11935 

Gly Asp Gly Leu Ala Arg Gly Tyr Thr Asp Ser Lys Leu Arg Val Asp 
11940 11945 11950 

Arg Phe He Tyr He Thr Leu Asp Gly Asn Arg Val Arg Ala Tyr Arg 
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11955 11960 11965 

Thr Gly Asp Arg Vai Arg His Arg Pro Lys Asp Gly Gin lie Glu Phe 
11970 11975 11980 

Phe Gly Arg Met Asp Gin Gin lie Lys lie Arg Gly His Arg lie Glu 
11985 11990 11995 12000 

Pro Ala Glu Val Glu Gin Ala Leu Ala Arg Asp Pro Ala lie Ser Asp 
12005 12010 12015 

Ser Ala Val lie Thr Gin Leu Thr Asp Glu Glu Glu Pro Glu Leu Val 
12020 12025 12030 

Ala Phe Phe Ser Leu Lys Gly Asn Ala Asn Gly Thr Asn Giy Val Asn 
12035 12040 12045 

Gly Val Ser Asp Gin Glu Lys lie Asp Gly Asp Glu Gin His Ala Leu 
12050 12055 12060 

Leu Met Glu Asn Lys lie Arg His Asn Leu Gin Ala Leu Leu Pro Thr 
12065 12070 12075 12080 

Tyr Met lie Pro Ser Arg lie lie His Val Asp Gin Leu Pro Val Asn 
12085 12090 12095 

Ala Asn Gly Lys lie Asp Arg Asn Glu Leu Ala Val Arg Ala Gin Ala 
12100 12105 12110 

Thr Pro Arg Thr Ser Ser Val Ser Thr Tyr Val Ala Pro Arg Asn Asp 
12115 12120 12125 

lie Glu Thr lie lie Cys Lys Glu Phe Ala Asp lie Leu Ser Val Arg 
12130 12135 12140 

Val Gly He Thr Asp Asn Phe Phe Asp Leu Gly Gly His Ser Leu He 
12145 12150 12155 12160 

Ala Thr Lys Leu Ala Ala Arg Leu Ser Arg Arg Leu Asp Thr Arg Val 
12165 12170 12175 

Ser Val Arg Asp Val Phe Asp Thr Pro Val Val Gly Gin Leu Ala Ala 
12180 12185 12190 

Ser He Gin Gin Gly Ser Thr Pro His Glu Ala He Pro Ala Leu Ser 
12195 12200 12205 

40 His Ser Gly Pro Val Gin Gin Ser Phe Ala Gin Gly Arg Leu Trp Phe 

12210 12215 12220 

Leu Asp Arg Phe Asn Leu Asn Ala Ala Trp Tyr He Met Pro Phe Gly 
12225 12230 12235 12240 
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Val Arg Leu Arg Gly Pro Leu Arg Val Asp Ala Leu Gin Thr Ala Leu 
12245 12250 12255 

Arg Ala Leu Glu Glu Arg His Glu Leu Leu Arg Thr Thr Phe Glu Glu 
12260 12265 12270 

Gin Asp Gly Val Gly Met Gin lie Val His Ser Pro Arg Met Arg Asp 
12275 12280 12285 

He Cys Val Val Asp He Ser Gly Ala Asn Glu Asp Leu Ala Lys Leu 
12290 12295 12300 

Lys Glu Glu Gin Gin Ala Pro Phe Asn Leu Ser Thr Glu Val Ala Trp 

12305 12310 12315 12320 
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Arg Val Ala Leu Phe Lys Ala Gly Glu Asn His His lie Leu Ser lie 
12325 12330 12335 

Val Met His His lie lie Ser Asp Gly Trp Ser Val Asp lie Phe Gin 
^ 12340 12345 12350 

Gin Glu Leu Ala Gin Phe Tyr Ser Val Ala Val Arg Gly His Asp Pro 
12355 12360 12365 

Leu Ser Gin Val Lys Pro Leu Pro lie His Tyr Arg Aso Phe Ala Val 
12370 12375 12380 ' 

Trp Gin Arg Gin Asp Lys Gin Val Ala Val His Glu Ser Gin Leu Gin 
12385 12390 12395 12400 

Tyr Trp lie Glu Gin Leu Ala Asp Ser Thr Pro Ala Glu lie Leu Ser 
^5 12405 12410 12415 

Asp Phe Asn Arg Pro Glu Val Leu Ser Gly Glu Ala Gly Thr Val Pro 
12420 12425 12430 

lie Val lie Glu Asp Glu Val Tyr Glu Lys Leu Ser Leu Phe Cys Arg 
20 12435 12440 12445 

Asn His Gin Val Thr Ser Phe Val Val Leu Leu Ala Ala Phe Arg Val 
12450 12455 12460 

Ala His Tyr Arg Leu Thr Gly Ala Glu Asp Ala Thr lie Gly Thr Pro 
25 12465 12470 12475 12480 

lie Ala Asn Arg Asn Arg Pro Glu Leu Glu Asp Leu lie Gly Phe Phe 
12485 12490 12495 



JO 



35 



40 



45 



50 



55 



Val Asn Thr Gin Cys Met Arg lie Ala Leu Glu Glu His Asp Asn Phe 
12500 12505 12510 

Leu Ser Val Val Arg Arg Val Arg Ser Thr Ala Ala Ser Ala Phe Glu 
12515 12520 12525 

Asn Gin Asp Val Pro Phe Glu Arg Leu Val Ser Ala Leu Leu Pro Gly 
12530 12535 12540 

Ser Arg Asp Ala Ser Arg Asn Pro Leu Val Gin Leu Met Phe Val Val 
12545 12550 12555 12560 

His Ser Gin Arg Asn Leu Gly Lys Leu Gin Leu Glu Gly Leu Glu Gly 
12565 12570 12575 

Glu Pro Thr Pro Tyr Thr Ala Thr Thr Arg Phe Asp Val Glu Phe His 
12580 12585 12590 

Leu Phe Glu Gin Asp Lys Gly Leu Ala Gly Asn Val Val Phe Ala Ala 
12595 12600 12605 

Asp Leu Phe Glu Ala Ala Thr lie Arg Ser Val Val Glu Val Phe His 
12610 12615 12620 

Asp Gin Pro Asp lie Ala He Ser Thr 
12625 12630 12635 12640 

Met Pro Leu Val Asp Gly Leu Ala Ala Leu Asn Ser Arg Asn Leu Pro 
12645 12650 12655 

Ala Val Glu Asp lie Glu Pro Asp Phe Ala Thr Glu Ala Ser Val Val 
12660 12665 12670 
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Asp Val Phe Gin Thr Gin Val Val Ala Asn Pro Aso Ala Leu Ala Val 
12675 12680 * 12685 

Thr Asp Thr 3er Thr Lys Leu Thr Tyr Ala Glu Leu Asp Gin Gin Ser 
12690 12695 12700 

Asp His Val Ala Ala Trp Leu Ser Lys Gin Lys Leu Pro Ala Glu Ser 
12705 12710 12715 12720 

lie Val Val Val Leu Ala Pro Arg Ser Ser Glu Thr lie Val Ala Cys 
12725 12730 12735 

lie Gly lie Leu Lys Ala Asn Leu Ala Tyr Leu Pro Met Asp Ser Asn 
12740 12745 12750 

Val Pro Glu Ala Arg Arg Gin Ala lie Leu Ser Glu lie Pro Gly Glu 
12755 12760 12765 

Lys Phe Val Leu Leu Gly Ala Gly Val Pro lie Pro Asp Asn Lys Thr 
12770 12775 12780 

Ala Asp Val Arg Met Val Phe lie Ser Asp lie Val Ala Ser Lys Thr 
12785 12790 12795 12800 

Asp Lys Ser Tyr Ser Pro Gly Thr Arg Pro Ser Ala Ser Ser Leu Ala 
12805 12810 12815 

Tyr Val lie Phe Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Met 
12820 12825 12830 

Val Glu His Arg Gly Val lie Ser Leu Val Lys Gin Asn Ala Ser Arg 
12835 12840 12845 

lie Pro Gin Ser Leu Arg Met Ala His Val Ser Asn Leu Ala Phe Asp 
12850 12855 12860 

Ala Ser Val Trp Glu lie Phe Thr Thr Leu Leu Asn Gly Gly Thr Leu 
12865 12870 12875 12880 

Phe Cys lie Ser Tyr Phe Thr Val Leu Asp Ser Lys Ala Leu Ser Ala 
12885 12890 12895 

Ala Phe Ser Asp His Arg lie Asn lie Thr Leu Leu Pro Pro Ala Leu 
12900 12905 12910 

Leu Lys Gin Cys Leu Ala Asp Ala Pro Ser Val Leu Ser Ser Leu Glu 
12915 12920 12925 

Ser Leu Tyr He Gly Gly As? Arg Leu Asp Gly Ala Asp Ala Thr Lys 
12930 12935 12940 

Val Lys Asp Leu Val Lys Gly Lys Ala Tyr Asn Ala Tyr Gly Pro Thr 
12945 12950 12955 12960 

Glu Asn Ser Val Met Ser Thr He Tyr Thr He Glu His Glu Thr Phe 
12965 12970 12975 

Ala Asn Gly Val Pro He Gly Thr Ser Leu Gly Pro Lys Ser Lys Ala 
12980 12985 12990 

Tyr He Met Asp Gin Asp Gin Gin Leu Val Pro Ala Gly val Met Gly 
12995 13000 13005 

Glu Leu Val Val Ala Gly Asp Gly Leu Ala Arg Gly Tyr Thr Asp Pro 
13010 13015 13020 

Ser Leu Asn Thr Gly Arg Phe He His He Thr He Asp Gly Lys Gin 
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^3025 13030 13035 13040 

Val Gin Ala Tyr Arg Thr Gly Asp Arg Val Arg Tyr Arg Pro Arg Asp 
13045 13050 13055 

Tyr Gin lie Glu Phe Phe Gly Arg Leu Asp Gin Gin He Lys He Ara 
13060 13065 13070 

Gly His Arg He Glu Pro Ala Glu Val Glu Gin Ala Leu Leu Ser Asp 
13075 13080 13085 

Ser Ser He Asn Asp Ala Val Val Val Ser Ala Gin Asn Lys Glu Glv 
13090 13095 13100 

Leu Glu Met Val Gly Tyr He Thr Thr Gin Ala Ala Gin Ser Val Asp 
13105 13110 13115 13120 

Lys Glu Glu Ala Ser Asn Lys Val Gin Glu Trp Glu Ala His Phe Asp 
13125 13130 13135 

Ser Thr Ala Tyr Ala Asn He Gly Gly He Asp Arg Asp Ala Leu Gly 
13140 13145 13150 

Gin Asp Phe Leu Ser Trp Thr Ser Met Tyr Asp Gly Ser Leu He Pro 
13155 13160 13165 

Arg Glu Glu Met Gin Glu Trp Leu Asn Asp Thr Met Arg Ser Leu Leu 
13170 13175 13180 

' Glu lie Gly Thr Gly Thr Gly 
^3185 13190 13195 13200 

Met Val Leu Phe Asn Leu Gly Lys Val Glu Gly Leu Gin Ser Tyr Ala 
13205 13210 13215 

Gly Leu Glu Pro Ser Arg Ser Val Thr Ala Trp Val Asn Lys Ala He 
13220 13225 13230 

Glu Thr Phe Pro Ser Leu Ala Gly Ser Ala Arg Val His Val Gly Thr 
13235 13240 13245 

Ala Glu Asp He Ser Ser He Asp Gly Leu Arg Ser Asp Leu Val Val 
13250 13255 13260 

'^y^ Ser Arg Glu Tyr Leu Ala Glu 

13265 13270 13275 13280 

Leu Thr Ala Asn Leu He Arg Leu Pro Gly Val Lys Arg He Phe Phe 
13285 13290 13295 

Gly Asp Met Arg Thr Tyr Ala Thr Asn Lys Asp Phe Leu Val Ala Aro 
13300 13305 13310 

^ Ala Val His Thr Leu Gly Ser Asn Ala Ser Lys Ala Met Val Arc Gin 

13315 13320 13325 

Yfi.^-''^ ^y^ Asp Asp Glu Glu Glu Leu Leu Val Asp Pro 

13330 13335 13340 

50 ^^^aV"^ ^^"^ ^^"^ ^^"^ Asp Gin Phe Pro Asp Glu He Lys His 

^3345 13350 13355 13360 

Val Glu He Leu Pro Lys Arg Met Ala Ala Thr Asn Glu Leu Ser Ser 
13365 13370 13375 
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Tyr Arg Tyr Ala Ala Val He His Val Gly Gly His Gin Met Pro Asn 
13380 13385 13390 
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Gly Glu Asp Glu Asp Lys Gin Trp Ala Val Lys Asp He Asn Pro Lys 
13395 13400 13405 

Ala Trp Val Asp Phe Ala Gly Thr Arg Met Asp Arg Gin Ala Leu Leu 
13410 13415 13420 

Gin Leu Leu Gin Asp Arg Gin Arg Gly Asp Asp Val Val Ala Val Ser 
13425 13430 13435 13440 

Asn He Pro Tyr Ser Lys Thr lie Met Glu Arg His Leu Ser Gin Ser 
13445 13450 13455 

Leu Asp Asp Asp Glu Asp Gly Thr Ser Ala Val Asp Gly Thr Ala Trp 
13460 13465 13470 

He Ser Arg Thr Gin Ser Arg Ala Lys Glu Cys Pro Ala Leu Ser Val 
13475 13480 13485 

Ala Asp Leu He Glu He Gly Lys Gly He Gly Phe Glu Val Glu Ala 
13490 13495 13500 

Ser Trp Ala Arg Gin His Ser Gin Arg Gly Gly Leu Asp Ala Val Phe 
13505 13510 13515 13520 

His Arg Phe Glu Pro Pro Arg His Ser Gly His Val Met Phe Arg Phe 
13525 13530 13535 

Pro Thr Glu His Lys Gly Arg Ser Ser Ser Ser Leu Thr Asn Arg Pro 
13540 13545 13550 

Leu His Leu Leu Gin Ser Arg Arg Leu Glu Ala Lys Val Arg Glu Arg 
13555 13560 13565 

Leu Gin Ser Leu Leu Pro Pro Tyr Met He Pro Ser Arg He Thr Leu 
13570 13575 13580 

Leu Asp Gin Met Pro Leu Thr Ser Asn Gly Lys Val Asp Arg Lys Lys 
13585 13590 13595 13600 

Leu Ala Arg Gin Ala Arg Val He Pro Arg Ser Ala Ala Ser. Thr Leu 
13605 13610 13615 

Asp Phe Val Ala Pro Arg Thr Glu He Glu Val Val Leu Cys Glu Glu 
13620 13625 13630 

Phe Thr Asp Leu Leu Gly Val Lys Val Gly He Thr Asp Asn Phe Phe 
13635 13640 13645 

Glu Leu Gly Gly His Ser Leu Leu Ala Thr Lys Leu Ser Ala Arg Leu 
13650 13655 13660 

Ser Arg Arg Leu Asp Ala Gly He Thr Val Lys Gin Val Phe Aso Gin 
13665 13670 13675 " 13680 

Pro Val Leu Ala Asp Leu Ala Ala Ser He Leu Gin Gly Ser Ser Arg 
13685 13690 13695 

His Arg Ser He Pro Ser Leu Pro Tyr Glu Gly Pro Val Glu Gin Ser 
13700 13705 13710 

Phe Ala Gin Gly Arg Leu Trp Phe Leu Asp Gin Phe Asn He Asp Ala 
13715 13720 13725 

Leu Trp Tyr Leu He Pro Phe Ala Leu Arg Met Arg Gly Pro Leu Gin 
13730 13735 13740 
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Val Asp Ala Leu Ala Ala Ala Leu Val Ala Leu Glu Glu Arg His Glu 
13745 13750 13755 13760 

Ser Leu Arg Thr Thr Phe Glu Glu Arg Asp Gly Val Gly lie Gin Val 
13765 13770 13775 

Val Gin Pro Leu Arg Thr Thr Lys Asr> lie Arg lie lie Asp Val Ser 
13780 13785 13790 

Gly Met Arg Asp Asp Asp Ala Tyr Leu Glu Pro Leu Gin Lys Glu Gin 
13795 13800 13805 

Gin Thr Pro Phe Asp Leu Ala Ser Glu Pro Gly Trp Arg Val Ala Leu 
13810 13815 13820 

Leu Lys Leu Gly Lys Asp Asp His lie Leu Ser lie Val Met His His 
13825 13830 13835 13840 

lie lie Ser Asp Gly Trp Ser Thr Glu Val Leu Gin Arg Glu Leu Gly 
13845 13850 13855 

Gin Phe Tyr Leu Ala Ala Lys Ser Gly Lys Ala Pro Leu Ser Gin Val 
13860 13865 13870 

Ala Pro Leu Pro lie Gin Tyr Arg Asp Phe Ala Val Trp Gin Arg. Gin 
13875 13880 13885 

Glu Glu Gin Val Ala Glu Ser Gin Arg Gin Leu Asp Tyr Trp Lys Lys 
13890 13895 13900 ^ ^ ^ 

Gin Leu Ala Asp Ser Ser Pro Ala Glu Leu Leu Ala Asp Tyr Thr Arg 
13905 13910 13915 13920 

Pro Asn Val Leu Ser Gly Glu Ala Gly Ser Val Ser Phe Val lie Asn 
13925 13930 13935 

Asp Ser Val Tyr Lys Ser Leu Val Ser Phe Cys Arg Ser Arg Gin Val 
13940 13945 13950 

Thr Thr Phe Thr Thr Leu Leu Ala Ala Phe Arg Ala Ala His Tyr Arg 
13955 13960 13965 

Met Thr Gly Ser Asp Asp Ala Thr lie Gly Thr Pro lie Ala Asn Arg 
13970 13975 13980 

Asn Arg Pro Glu Leu Glu Asn Leu lie Gly Cys Phe Val Asn Thr Gin 
13935 13990 13995 14000 

Cys Met Arg lie Thr lie Gly Asp Asp Glu Thr Phe Glu Ser Leu Val 
14005 14010 14015 

Gin Gin Val Arg Ser Thr Thr Ala Thr Ala Phe Glu Asn Gin Asp Val 
14020 14025 14030 

Pro Phe Glu Arg He Val Ser Thr Leu Ser Ala Gly Ser Arg Asp Thr 
14035 14040 14045 

Ser Arg Asn Pro Leu Val Gin Leu Leu Phe Ala Val His Ser Gin Gin 
14050 14055 14060 

Gly Leu Gly Arg He Gin Leu Asp Gly Val Val Asp Glu Pro Val Leu 
14065 14070 14075 14080 

Ser Thr Val Ser Thr Arg Phe Asp Leu Glu Phe His Ala Phe Gin Glu 
14085 14090 14095 

Ala Asp Arg Leu Asn Gly Ser Val Met Phe Ala Thr Asp Leu Phe Gin 
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14100 14105 14110 

Pro .Glu Thr lie Gin Gly Phe Val Ala Val Val Glu Glu Val Leu Gin 
14115 14120 14125 

Arg Gly Leu Glu Gin Pro Gin Ser Pro lie Ala Thr Met Pro Leu Ala 
14130 14135 14140 

Glu Gly lie Ala Gin Leu Arg Asp Ala Gly Ala Leu Gin Met Pro Lys 
14145 14150 14155 14160 

Ser Asp Tyr Pro Arg Asn Ala Ser Leu Val Asp Val Phe Gin Gin Gin 
14165 14170 14175 

Ala Met Ala Ser Pro Ser Thr Val Ala Val Thr Asp Ser Thr Ser Lys 
14180 14185 14190 

Leu Thr Tyr Ala Glu Leu Asp Arg Leu Ser Asp Gin Ala Ala Ser Tyr 
14195 14200 14205 

Leu Arg Arg Gin Gin Leu Pro Ala Glu Thr Met Val Ala Val Leu Ala 
14210 14215 14220 

20 Pro Arg Ser Cys Glu Thr lie He Ala Phe Leu Ala He Leu Lys Ala 

14225 14230 14235 14240 

Asn Leu Ala Tyr Met Pro Leu Asp Val Asn Thr Pro Ser Ala Arg Met 
14245 14250 14255 

25 Glu Ala He He Ser Ser Val Pro Gly Arg Arg Leu He Leu Val Gly 

14260 14265 14270 

Ser Gly Val Arg His Ala Asp He Asn Val Pro Asn Ala Lys Thr Met 
14275 14280 14285 

30 Leu He Ser Asp Thr Val Thr Gly Thr Asp Ala He Gly Thr Pro Glu 

14290 14295 14300 

Pro Leu Val Val Arg Pro Ser Ala Thr Ser Leu Ala Tyr Val He Phe 
14305 14310 14315 14320 



Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Val Met Val Glu His Arg 
14325 14330 14335 

Ala He Met Arg Leu Val Lys Asp Ser Asn Val Val Thr His Met Pro 
14340 14345 14350 

Pro Ala Thr Arg Met Ala His Val Thr Asn He Ala Phe Asp Val Ser 
14355 14360 14365 

Leu Phe Glu Met Cys Ala Thr Leu Leu Asn Gly Gly Thr Leu Val Cys 
14370 14375 14380 

He Asp Tyr Leu Thr Leu Leu Asp Ser Thr Met Leu Arg Glu Thr Phe 
^ 14385 14390 14395 14400 

Glu Arg Glu Gin Val Arg Ala Ala He Phe Pro Pro Ala Leu Leu Arg 
14405 14410 14415 

Gin Cys Leu Val Asn Met Pro Asp Ala He Gly Met Leu Glu Ala Val 
^ 14420 14425 14430 

Tyr Val Ala Gly Asp Arg Phe His Ser Arg Asp Ala Arg Ala Thr Gin 
14435 14440 14445 

Ala Leu Ala Gly Pro Arg Val Tyr Asn Ala Tyr Gly Pro Thr Glu Asn 
55 14450 14455 14460 
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Ala lie Leu Ser Thr lie Tyr Asn lie Asp Lys His Asp Pro Tyr Val 
14465 14470 14475 14480 

Asn Gly Val Pro lie Gly Ser Ala Val Ser Asn Ser Gly Ala Tyr Val 
14485 14490 14495 

Met Asp Arg Asn Gin Gin Leu Leu Pro Pro Gly Val Met Gly Glu Leu 
14500 14505 14510 

Val Val Thr Gly Glu Gly Val Ala Arg Gly Tyr Thr Asp Ala Ser Leu 
14515 14520 14525 

Asp Thr Asp Arg Phe Val Thr Val Thr lie Asp Gly Gin Arg Gin Arg 
14530 14535 14540 

Ala Tyr Arg Thr Gly Asp Arg Val Arg Tyr Arg Pro Lys Gly Phe Gin 
14545 14550 14555 14560 

lie Glu Phe Phe Gly Arg Leu Asp Gin Gin Ala Lys lie Arg Gly His 
14565 14570 14575 

Arg Val Glu Leu Gly Glu Val Glu His Ala Leu Leu Ser Glu Asn Ser 
14580 14585 14590 

Val Thr Asp Ala Ala Val Val Leu Arg Thr Met Glu Glu Glu Asp Pro 
14595 14600 14605 

Gin Leu Val Ala Phe Val Thr Thr Asp His Glu Tyr Arg Ser Gly Ser 
14610 14615 14620 

Ser Asn Glu Glu Glu Asp Pro Tyr Ala Thr Gin Ala Ala Gly Asp Met 
14625 14630 14635 14640 

Arg Lys Arg Leu Arg Ser Leu Leu Pro Tyr Tyr Met Val Pro Ser Arg 
14645 14650 14655 

Val Thr lie Leu Arg Gin Met Pro Leu Asn Ala Asn Gly Lys Val Asp 
14660 14665 14670 

Arg Lys Asp Leu Ala Arg Arg Ala Gin Met Thr Pro Thr Ala Ser Ser 
14675 14680 14685 

Ser Gly Pro Val His Val Ala Pro Arg Asn Glu Thr Glu Ala Ala lie 
14690 14695 14700 

Cys Asp Glu Phe Glu Thr lie Leu Gly Val Lys Val- Gly lie Thr Asp 
14705 14710 14715 14720 

Asn Phe Phe Glu Leu Gly Gly His Ser Leu Leu Ala Thr Lys Leu Ala 
14725 14730 14735 

Ala Arg Leu Ser Arg Arg Met Gly Leu Arg lie Ser Val Lys Asp Leu 
14740 14745 14750 

Phe Asp Asp Pro Val Pro Val Ser Leu Ala Gly Lys Leu Glu Gin Gin 
14755 14760 14765 

Gin Gly Phe Ser Gly Glu Asp Glu Ser Ser Thr Val Gly lie Val Pro 
14770 14775 14780 

Phe Gin Leu Leu Pro Ala Glu Met Ser Arg Glu lie lie Gin Arg Asp 
14785 14790 14795 14800 

Val Val Pro Gin lie Glu Asn Gly His Ser Thr Pro Leu Asp Met Tyr 
14305 14810 14815 
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Pro Ala Thr 31n Thr Gin Tie Phe Phe Leu His Asp Lys Ala Thr Gly 
14820 14825 14830 

Kis Pro Ala Thr Pro Pro Leu Phe Ser Leu Asp Phe Pro Glu Thr Ala 
14835 14840 14845 

Asp Cys Arg Arg Leu Ala Ser Ala Cys Ala Ala Leu Val Gin His Phe 

14850 14855 14860 

Asp lie Phe Arg Thr Val Phe Val Ser Arg Gly Gly Arg Phe Tyr Gin 

14865 14870 14875 14880 

Val Val Leu Ala His Leu Asp Val Pro Val Glu Val lie Glu Thr Glu 
14885 14890 14895 

Gin Glu Leu Asp Glu Val Ala Leu Ala Leu His Glu Ala Asp Lys Gin 
14900 14905 14910 

Gin Pro Leu Arg Leu Gly Arg Ala Met Leu Arg lie Ala lie Leu Lys 
14915 14920 14925 

Arg Pro Gly Ala Lys Met Arg Leu Val Leu Arg Met Ser His Ser Leu 

14930 14935 14940 

Tyr Asp Gly Leu Ser Leu Glu His lie Val Asn Ala Leu His Ala Leu 

14945 14950 14955 14960 

Tyr Ser Asp Lys His Leu Ala Gin Ala Pro Lys Phe Gly Leu Tyr Met 
14965 14970 14975 

Kis His Met Ala Ser Arg Arg Ala Glu Gly Tyr Asn Phe Trp Arg Ser 
14980 14985 14990 

lie Leu Gin Gly Ser Ser Met Thr Ser Leu Lys Arg Ser Val Gly Ala 
14995 15000 15005 

Leu Glu Ala Met Thr Pro Ser Ala Gly Thr Trp Gin Thr Ser Lys Ser 

15010 15015 15020 

lie Arg lie Pro Pro Ala Ala Leu Lys Asn Gly lie Thr Gin Ala Thr 

15025 15030 15035 15040 

Leu Phe Thr Ala Ala Val Ser Leu Leu Leu Ala Lys His Thr Lys Ser 
15045 15050 15055 

Thr Asp Val Val Phe Gly Arg Val Val Ser Gly Arg Gin Asp Leu Ser 
15060 15065 15070 

lie Asn Cys Gin Asp lie Val Gly Pro Cys lie Asn Glu Val Pro Val 
15075 15080 15085 

Arg Val Arg lie Asp Glu Gly Asp Asp Met Gly Gly Leu Leu Arg Ala 

15090 15095 15100 

lie Gin Asp Gin Tyr Thr Ser Ser Phe Arg His Glu Thr Leu Gly Leu 

15105 15110 15115 15120 

Gin Glu Val Lys Glu Asn Cys Thr Asp Trp Thr Asp Ala Thr Lys Glu 
15125 15130 15135 

Phe Ser Cys Cys lie Ala Phe Gin Asn Leu Asn Leu His Pro Glu Ala 
15140 15145 15150 

Glu lie Glu Gly Gin Gin lie Arg Leu Glu Gly Leu Pro Ala Lys Asp 
15155 15160 15165 

Gin Ala Arg Gin Ala Asn Gly His Ala Pro Asn Gly Thr Asn Gly Thr 
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15170 15175 15180 

Asn Gly Thr Asn Giy Thr Asn Gly Ala Asn Gly Thr Asn GXy Thr Asn 
15185 15190 15195 15200 

Gly Thr Asn Gly Thr His Ala Asn Gly lie Asn Gly Ser Asn Gly Val 
15205 15210 15215 

Asn Gly Arg Asp Ser Asn Val Val Ser Ala Ala Gly Asp Gin Ala Pro 
15220 15225 15230 

Val His Asp Leu Asp He Val Gly He Pro Glu Pro Asp Gly Ser Val 
15235 15240 15245 

Lys He Gly He Gly Ala Ser Arg Gin He Leu Gly Glu Lys Val Val 
15250 15255 15260 

91^^^^^ Glu Thr Met Leu Ala Leu Ser Arg 

1526S 15270 15275 15280 

Thr 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNSSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 (iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tolypocladium geodes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGCAACTAT CGGCTCTCCA ATTGCGAACA GAAATCGAGC AGAGCTTGAG GGCCTTATTG 60 

GCTGTTTTGT GAATACTCAG TGTATGAGAC TGCCAGTTAC CGATGAAGAT ACATTCGCCA 120 

ATTTGATTGA CTGTGTACGA GAGACGTCAA CCGAGGCCTT GAGCACCAAG ATATCCTT 17 8 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
^ (D) TOPOLOGY; unknown 

(ii) MOI-ECULE TYPE: dna (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neocosmospora vas infect a 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5 

ACATCGGGGG TATTGATCGC GATGCCCTCG GACAGGACTT CTTATCCTGG ACATCCATGT 60 

ACGACGGCTC ATTGATTCCC CGGGAAGAGA TGCAGGAATG GCTCAGCGAC ACTATGCACT 120 

CACTCCTCGA CAACCAGCCA CCCGGAAGAG TGCTCGAGAT CGGAACTGGT ACCGGTATGG 180 

10 TGCTTTTCAA TCTCGGCAAG GTTGAGGGAC TACAGAGCTA TGCCGGTCTT GAGCCCTCGC 240 

GCTCCGTCAC TGCCTGGGTT AACAAGGCAA TCGAAACTTT CCCAAGCCTG GCAGGAAGCG 300 

CCCGAGTCCA CGTTGGAACC GCCGAGGATG TCAGCTCCAT CAATGGACTG CGTGCCGATC 360 

TCGTTGTGAT CAACTCGGTC GCCCAATACT TCCCAAGTCG AGAATATCTC GCTGAGCTGA 420 

CGGCCAACTT GATTCGACTG CCCGGCGTCA AGCGTATTTT CTTCGGCGAC ATGAGAACCT 480 

ATGCCACCAA TAAGGACTTC TTGGTGGCAC GAGCAGTCCA TACCCTAGGG TCCAATGCAT 540 

CTAAGGCCAT GGTTCGACAA CAGGTGGCCA AGCTTGAAGA TGACGAGGAA GAGTTGCTTG 600 

TTGACCCTGC CTTCTTCACC AGCCTGAGCG ACCAGTTCCC TGACGAAATC AAGCACGTCG 660 

AGATTCTGCC AAAGAGGATG GCCGCGACCA ACGAACTCAG CTCTTACCGA TATGCTGCTG 720 

TTATTCATGT GGGAGGCCAC GAGATGCCGA ATGGGGAGGA TGAGGATAAG CAATGGGCTG 780 

25 TCAAGGATAT CGATCCGAAG GCCTGGGTGG ACTTCGCCGG CACGAGGATG GACCGTCAGG 840 

CTCTCTTGCA GCTCCTCCAG GACCGCCAAC GTGGCGATGA CGTTGTTGCC GTCAGTAACA 90 C 

TCCCATACAG CAAGACCATC ATGGAGCGCC ATCTGTCTCA GTCACTTGAC GATGACGAGG 960 

ACGGCACTTC AGATGCAGAC GGAACGGCCT GGATATCGGC CACTCAATCA CGGGCGAAGG 1020 

AATGCCCTGC TCTCTCAGTG GCCGACCTGA TTGAGATTGG TAAGGGGATC GGCTTCCAAG 1080 

TTGAGACCAG CTGGGCTCGA CAACACTCCC AGCGCGGCGG ACTCGATGCT GTTTTCCACC 1140 

GATTCGAAAA ACCAAGACAC TCGGGTCATG TCATGTTCAG GTTCCCAACT GAACACAAGG 1200 

GGCCGGTCTT CGAGCAGTCT CACGAATCGC CCGCTACACC TGGTTCAGAG CCGCCGGCTG 12 60 

GAGGCAAAGG TCCGCGAGCG GCTGCAATCG CTGCTTCCAT CGTACATGAT TCCCTCTCGG 1320 

ATCATGTTGC TCGATCAGAT GCCTCTCACG TCCAACGGCA AGGTGGATCG CAAGAAGCTC 1380 

40 GCTCGACAAG CCCGGGTCAT CCCAACAATT GCCGCAAGCA CGTTGGACTT TGTGGCGCGC 1440 

ACGCACGGAA ATCGAGGTCG GTTCTCTGCG AAGAATTTAC CGATCTACTA GGCGTCAAGG 1500 

TCGGCATTAC AGACAACTTC TTCGAGTTGG GCGGCCATTC GCTGCTGGCC ACGAAACTGA 1560 

GCGCACGTCT AAGTCGCAGA CTGGACGCCG GTGTCACTGT GAAGCAGATC TTTGACCAGC 1620 

CAGTACTTGC TGATCTTGCT GCTTCTATTC GTCAAGGCTC GTCCCGTCAC AGGTCTATCC 1680 

CGTCTTTACC CTACGAAGGA CCCGTGGAGC AGT 1713 
(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
<D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tolypociadium niveum 
<B) STRAIN: ATCC 34921 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CATCAGCAAT CATGGGCAAC AAAGTCTTCT TCGACATTGA GTGGGAGGGC CCCGTCATGC 60 

iS AG3GTTGCAA GCCTACCTCT ACCGTCAAAG AGCAGTCTGG TCGCATCAAC TTCAAGCTGT 120 

AC3ATGACGT CGTCCCCAAG ACCGCCGAGA ACTTCCGCGC TCTCTGCACC GGCGAGAAGG 180 

GCTTCGGCTA CGAGGGCTCG TCCTTCCACC GTATCATCCC CGAGTTCATG CTCCAGGGCG 240 

GCGACTTCAC CCGCGGTAAC GGCACTGGCG GCAAGTCCAT CTACGGCGAG AAGTTTGCCG 300 

ATGAGAACTT CCAGCTGAAG CACGACCGCC CCGGTCTGCT GTCCATGGCT AACGCTGGCC 360 

CCAACACCAA CGGCTCCCAG TTCTTCGTCA CCACCGTCGT CACCTCGTGG CTCAACGGCC 420 

ACCACGTCGT CTTCGGCGAG GTCGCTGACC AGGAGTCCCT GGACGTCGTC AAGGCCCTTG 480 

AGGCCACTGG CTCTGGTAGC GGCGCTGTCA AGTACAACAA GCGCGCCACC ATTGTCAAGT 540 

CTGGCGAGCT GTAAGCTATG GCATCTGTGT ATCTTGCGAT TTCCTGCACC CAATTCGGAC 600 

GGACAAAAGA GGCGCTGCCC ACAGCAAGGA CCTTTGGTTC ACGGGACGGC TTGAA 655 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
40 (iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGATATCGT GAATTGTAAT ACGACTCACT ATA 33 
(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 








GGATCCGTGA 


ATTGTAA7AC 


GACTCACTAT 


AGGGCGAATT 


CGCTCGACGT 


CACCTAGGAG 


60 


ATCAGCCAGC 


TCCTTGGCCC 


TGTTCCGCAC 


GTTGATGCCC 


TGGTCTTTGC 


CGTTTGGATC 


120 


GATGAAGTGG 


AACTGGCGCA 


GCATCTTCAA 


AAGTGTGATG 


TGTCCCCGAG 


CGTCATCAAT 


180 


CACACGCTCA 


GAGCCATGCT 


TGACGAGGAA 


CTCGAGCAGT 


TGCAGAGCCT 


TGTAGATCTG 


240 


GCGCCACTCC 


TCGGCCGACT 


TCTCCGTGAA 


CCGTCGATAT 


ATCATCGGCA 


TGATCTCGTT 


300 


GAGGGTTTGG 


CTGGTTCTGT 


TAGCTGAAGC 


CGGGCTGTTC 


AGTCGTCGAA 


CCGCGTACTA 


360 


GTTGAAGGTG 


CCATTGGCAA 


TCTCCTGCAT 


AATACTGGAC 


GATGCTCCCC 


ATGGCTCGTT 


420 


GTTCGTTGCC 


TCTCGGACCT 


AGTACACGGA 


GTTAGCCACC 


GTGTTAACAA 


ACCGTCGCGG 


480 


CCGCAGACTA 


ACCTTGGACT 


CCATCTCGGT 


ATAGTTCATA 


ACAGCTACAT 


GCCAGGTCAG 


540 


CATTGGACGC 


GCCAGGGCTG 


AGGTCAGGCC 


TGGTACCATT 


TTGCGCCTTT 


CGGAACCCAG 


600 


CCTTGAGGTC 


GTACAAGGTC 


AGGTTGGAGA 


CTGTGTTCTT 


GATGTCGTTC 


AAGTCCATTT 


660 


TGGCAGATTC 


GACTTAGCGA 


GACCGGCCGG 


GAGCGGCAGA 


GGAGTTGTCG 


ATTCAGCACG 


720 


AGTCGCTGAT 


GAGCGATGGT 


TGTGGTGCAA 


GTCGATGGTC 


CGAGGGCGGG 


TGGTAGAGGT 


780 


GCTTGTCGCG 


ATGGACAGCT 


GGACTTTCGG 


GCCGCCAGCG 


ACACCTACCC 


GGCCTTGATG 


840 


GGTCAGAGGG 


ATGATCACGT 


GATATGGGTC 


GGAGTCGCAT 


CGTACTTCGT 


ACCAGCATCA 


900 


TCTCCAAGCC 


AGAGGCAGCA 


GAGATTATAT 


GACTGCAAAT 


GTGAAACGAA 


ATAAACCGTC 


960 


AATATGGTAT 


TTATGTTGGC 


AATTGCATGA 


TGCATCCCGG 


TGGAATTGAA 


CTAGAACGTC 


1020 


GAGGGCTTGC 


ATACCAGAGG 


CTGCGGGTGC 


ATCGTGGGCA 


GCGGTACCTG 


AGACTTCAGG 


1080 


CCAGAACGAC 


TGCTAATAAG 


CCGCGACGGA 


GCCAAAACTT 


TTCCCCTTTC 


CAGAGGCTCT 


1140 


CAGCTTTCGA 


CTCAGCCATT 


TGAACTTGCG 


ACTCAAGCCC 


GTTCATAACA 


CTTCATCTCT 


1200 


TGTACTTCTA 


CCGCATTACC 


TCCTGTACGA 


ATTGTAATCC 


CAGGTATGTC 


TATTTTCCTG 


1260 


TTGTTCTCGT 


CACATGCCCT 


CCCCAGCATG 


CGCAATGTCT 


T7GGACAACG 


CAGCTCCTCT 


1320 


CGACACATCA 


CAAAGGCTTC 


ACCCAGCAGA 


GCACGCGAGA 


GCCTGCGCGC 


GACAGCCTGC 


1380 


GAGCGACATG 


CAGCGCTTCC 


CTGGAAGCCA 


ACTGCACCAG 


CCTGGAAAGT 


TGCGCAGTTT 


1440 


GCCAGGGGGC 


CTCCGTCCCC 


CAGAATGGAT 


GGCACTCCTC 


GGCTTGACCT 


GGAGCGCTGC 


1500 


TCCCGATCAA 


GCCAGAGCCC 


GCCGGCGATG 


GGGACTGGCC 


GCGCCAGCCT 


CTGCACATGA 


1560 


GTGTGCTGGT 


TGGCTGGAGG 


TGGGTGGCCT 


TTGGCCTCCC 


AACCAGTCCC 


CACCATTTGC 


1620 


T3GAAGCTGC 


TGCAGCTGGT 


CGGAACGCAC 


CCAAGCCGTT 


GAGCTCAGCG 


CTCTGTCGGG 


1680 


TCGAGCGCCC 


ATTGGGGTTC 


CCGCGAAGGT 


CCTTTGACTG 


GGCCGGGGCC 


ACTCGTCTTG 


1740 


CCGGCCAGAG 


CTGAGCTCGC 


TGGTCTGGCA 


GCGACAGCAG 


CCGGGAGCTC 


CGTTGTCTAG 


1800 
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GCGATGAGCG 


CAGCGGCCAG 


AGCTCCGGGC 


CGGATCGGTG 


ACCTCACAGC 


CGTGGAAGCT 


1860 


CCTGGGCCCC 


CGAATCAAGG 


ACCGCAATTC 


CACGTGACTG 


GCCGGTTGCT 


CCCCTTCCGG 


1920 


CATTGCCCGC 


CCCGGTATTA 


CACCCCTTTG 


CGCGCCCTGG 


TTGGTTCAAA 


GTCCCACCGC 


1980 


TAACTTTTAA 


CCCCTCCAGC 


AGCCTTCAAA 


ATGAAGTCAA 


CGCTCCTTCG 


ACCCCTCCTA 


2040 


CCCCGCTATA 


AGCTCTGCTC 


CCCCGGGTCA 


AGATCTTTCC 


CTCTTCCACA 


ACTTGCATCA 


2100 


GCTTCCAACA 


CATTCCGAGC 


TGCTCGATTC 


TTCTCCGCAA 


CATCAGCAAT 


CATCGAT 


2157 



15 Claims 

1. An isolated DNA sequence which codes for an enzyme having cyctosporin synthetase-like activity. 

2. A DNA sequence according to claim 1 which codes for cyclosporin synthetase or an enzyme that is at 
20 least 70% homologous thereto and that has cyclosporin synthetase-like activity. 

3. A D^4A sequence according to daim 1 or claim 2 which codes for an enzyme that has cyclosporin synthe- 
tase-like activity and in which at least one amino-acid recognition unit is different from that of cyctosporin 
synthetase. 

25 

4. A DNA sequence according to any of claims 1 to 3 which includes the 2890 bp Sail restriction fragment 
containing sequences 40239 to 43129 of Seq Id 1, or a sequence which hybridizes thereto. 

5. A DNA sequence according to any of claims 1 to 3 which indudes the 2482 bp Sail restriction fragment 
containing sequences 37781 to 40244 of Seq Id 1, or a sequence which hybridizes thereto. 

30 

6. A DNA sequence according to daim 1 which includes the sequence of Seq Id 1, or a sequence that hy- 
bridizes thereto. 

7. A DNA sequence according to daim 1 which codes for an enzyme having an amino add sequence as given 
35 in Seq Id 2. 

8. A recombinant vector containing a DNA sequence as defined in any one of claims 1 to 7. 



40 
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9. A recombinant vector according to daim 8 which has a restriction map as set out In any one of figures 2 
to 5. 

10. A host cell carrying a vector according to claim 8 or claim 9. 

11. A process for the production of cydosporin or a cyclosporin derivative, comprising cultivating a host cell 
according to daim 10 and causing the host cell to produce the cydosporin or cydosporin derivative. 

12. A method for the production of a cyclosporin derivative, comprising altering the DNA sequence cod ing for 
cydosporin synthetase so that the enzyme causes the production of the cydosporin derivative, placing 
the altered DNA sequence in a vector, transforming a host cell with the vector, and causing the host ceil 
to produce the cydosporin derivative. 

13. A method according to daim 11 in which the DNA sequence coding for cydosporin synthetase is altered 
by changing the fragments that code for amino add recognition units. 
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FIGURE 1 
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FIGURE 2 
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FIGURE 3 
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FIGURE 4 



7663.ClaI 
-582.Bgin. 
7141. PstI J I 



EcoRI.249 
Pstl.339 
Ncol.358 



679' 



6574, A 



6083,Asp718 
5988. NotI 
5920.NcoI 
5868.SpeI" 

5745, Belli; 
5722,XhoI . 
56l8.EcoRI 



5H6,SalI 

4924,XhoI 

4749,EcoRI / 
47l4,BamHI 



Seal, 974 
BamHI.1049 




Ncol,1640 
Xbal.1761 
i .- Sphl.1770 



Seal. 2922 



92 



BNSCXXiD: <EP_057eeiaAaj_» 



EP 0 578 616 A2 



FIGURE 5 



CO 

Oi 
Oi 

CO 




^ o 



o 

U4 



CO 2; 



oo 

CO 

o 



o 

CM 

m O 

^ 

o 
uj 

^-w _ 

o 

w 



93 



BNSOOCID: <EP ^057881 6A2J_> 



THIS PAGE BUNK (umo) 



